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Preface 


It is less than five years since the last edition of 
Modern Physical Metallurgy was enlarged to include 
the related subject of Materials Science and Engi- 
neering, appearing under the title Metals and Mate- 
rials: Science, Processes, Applications. In its revised 
approach, it covered a wider range of metals and 
alloys and included ceramics and glasses, polymers 
and composites, modem alloys and surface engineer- 
ing. Each of these additional subject areas was treated 
on an individual basis as well as against unifying 
background theories of stmcture, kinetics and phase 
transformations, defects and materials characteriza- 
tion. 

In the relatively short period of time since that 
previous edition, there have been notable advances 
in the materials science and engineering of biomat- 
erials and sports equipment. Two new chapters have 
now been devoted to these topics. The subject of 
biomaterials concerns the science and application of 
materials that must function effectively and reliably 
whilst in contact with living tissue; these vital mat- 
erials feature increasingly in modern surgery, medicine 
and dentistry. Materials developed for sports equip- 
ment must take into account the demands peculiar 
to each sport. In the process of writing these addi- 
tional chapters, we became increasingly conscious 
that engineering aspects of the book were coming 
more and more into prominence. A new form of 
title was deemed appropriate. Finally, we decided 
to combine the phrase ‘physical metallurgy’, which 
expresses a sense of continuity with earlier edi- 
tions, directly with ‘materials engineering’ in the 
book’s title. 


Overall, as in the previous edition, the book aims to 
present the science of materials in a relatively concise 
form and to lead naturally into an explanation of the 
ways in which various important materials are pro- 
cessed and applied. We have sought to provide a useful 
survey of key materials and their interrelations, empha- 
sizing, wherever possible, the underlying scientific and 
engineering principles. Throughout we have indicated 
the manner in which powerful tools of characteriza- 
tion, such as optical and electron microscopy, X-ray 
diffraction, etc. are used to elucidate the vital relations 
between the structure of a material and its mechani- 
cal, physical and/or chemical properties. Control of the 
microstructure/property relation recurs as a vital theme 
during the actual processing of metals, ceramics and 
polymers; production procedures for ostensibly dissim- 
ilar materials frequently share common principles. 

We have continued to try and make the subject 
area accessible to a wide range of readers. Sufficient 
background and theory is provided to assist students 
in answering questions over a large part of a typical 
Degree course in materials science and engineering. 
Some sections provide a background or point of entry 
for research studies at postgraduate level. For the more 
general reader, the book should serve as a useful 
introduction or occasional reference on the myriad 
ways in which materials are utilized. We hope that 
we have succeeded in conveying the excitement of 
the atmosphere in which a life-altering range of new 
materials is being conceived and developed. 

R. E. Smallman 
R. J. Bishop 



Chapter 1 


The structure and bonding of atoms 


1.1 The realm of materials science 

In everyday life we encounter a remarkable range of 
engineering materials: metals, plastics and ceramics 
are some of the generic terms that we use to describe 
them. The size of the artefact may be extremely small, 
as in the silicon microchip, or large, as in the welded 
steel plate construction of a suspension bridge. We 
acknowledge that these diverse materials are quite lit- 
erally the stuff of our civilization and have a deter- 
mining effect upon its character, just as cast iron did 
during the Industrial Revolution. The ways in which 
we use, or misuse, materials will obviously also influ- 
ence its future. We should recognize that the pressing 
and interrelated global problems of energy utilization 
and environmental control each has a substantial and 
inescapable ‘materials dimension’. 

The engineer is primarily concerned with the func- 
tion of the component or structure, frequently with 
its capacity to transmit working stresses without risk 
of failure. The secondary task, the actual choice 
of a suitable material, requires that the materials 
scientist should provide the necessary design data, 
synthesize and develop new materials, analyse fail- 
ures and ultimately produce material with the desired 
shape, form and properties at acceptable cost. This 
essential collaboration between practitioners of the 
two disciplines is sometimes expressed in the phrase 
‘Materials Science and Engineering (MSE)\ So far 
as the main classes of available materials are con- 
cerned, it is initially useful to refer to the type of 
diagram shown in Figure 1.1. The principal sectors 
represent metals, ceramics and polymers. All these 
materials can now be produced in non-crystalline 
forms, hence a glassy ‘core’ is shown in the diagram. 
Combining two or more materials of very different 
properties, a centuries-old device, produces important 
composite materials: carbon-fibre-reinforced polymers 
(CFRP) and metal-matrix composites (MMC) are mod- 
em examples. 



1983 ). 


Adjectives describing the macroscopic behaviour of 
materials naturally feature prominently in any lan- 
guage. We write and speak of materials being hard, 
strong, brittle, malleable, magnetic, wear-resistant, etc. 
Despite their apparent simplicity, such terms have 
depths of complexity when subjected to scientific 
scmtiny, particularly when attempts are made to relate 
a given property to the internal structure of a material. 
In practice, the search for bridges of understanding 
between macroscopic and microscopic behaviour is a 
central and recurrent theme of materials science. Thus 
Sorby’s metallurgical studies of the structure/property 
relations for commercial irons and steel in the late 
nineteenth century are often regarded as the beginning 
of modern materials science. In more recent times, the 
enhancement of analytical techniques for characteriz- 
ing stmctures in fine detail has led to the development 
and acceptance of polymers and ceramics as trustwor- 
thy engineering materials. 






2 Modern Physical Metallurgy and Materials Engineering 


Having outlined the place of materials science in 
our highly material-dependent civilization, it is now 
appropriate to consider the smallest structural entity in 
materials and its associated electronic states. 

1.2 The free atom 

1.2.1 The four electron quantum numbers 

Rutherford conceived the atom to be a positively- 
charged nucleus, which carried the greater part of the 
mass of the atom, with electrons clustering around it. 
He suggested that the electrons were revolving round 
the nucleus in circular orbits so that the centrifugal 
force of the revolving electrons was just equal to the 
electrostatic attraction between the positively-charged 
nucleus and the negatively-charged electrons. In order 
to avoid the difficulty that revolving electrons should, 
according to the classical laws of electrodynamics, 
emit energy continuously in the form of electromag- 
netic radiation, Bohr, in 1913, was forced to conclude 
that, of all the possible orbits, only certain orbits were 
in fact permissible. These discrete orbits were assumed 
to have the remarkable property that when an elec- 
tron was in one of these orbits, no radiation could take 
place. The set of stable orbits was characterized by the 
criterion that the angular momenta of the electrons in 
the orbits were given by the expression nh/27i, where 
h is Planck’s constant and n could only have integral 
values {n = 1, 2, 3, etc.). In this way, Bohr was able to 
give a satisfactory explanation of the line spectrum of 
the hydrogen atom and to lay the foundation of modern 
atomic theory. 

In later developments of the atomic theory, by de 
Broglie, Schrodinger and Heisenberg, it was realized 
that the classical laws of particle dynamics could not be 
applied to fundamental particles. In classical dynamics 
it is a prerequisite that the position and momentum of 
a particle are known exactly: in atomic dynamics, if 
either the position or the momentum of a fundamental 
particle is known exactly, then the other quantity 
cannot be determined. In fact, an uncertainty must 
exist in our knowledge of the position and momentum 
of a small particle, and the product of the degree of 
uncertainty for each quantity is related to the value 
of Planck’s constant (h = 6.6256 x 10 -34 J s). In the 
macroscopic world, this fundamental uncertainty is 
too small to be measurable, but when treating the 
motion of electrons revolving round an atomic nucleus, 
application of Heisenberg’s Uncertainty Principle is 
essential. 

The consequence of the Uncertainty Principle is that 
we can no longer think of an electron as moving in 
a fixed orbit around the nucleus but must consider 
the motion of the electron in terms of a wave func- 
tion. This function specifies only the probability of 
finding one electron having a particular energy in the 
space surrounding the nucleus. The situation is fur- 
ther complicated by the fact that the electron behaves 
not only as if it were revolving round the nucleus 


but also as if it were spinning about its own axis. 
Consequently, instead of specifying the motion of an 
electron in an atom by a single integer n, as required 
by the Bohr theory, it is now necessary to specify 
the electron state using four numbers. These numbers, 
known as electron quantum numbers, are n, /, m and 
s, where n is the principal quantum number, / is the 
orbital (azimuthal) quantum number, m is the magnetic 
quantum number and s is the spin quantum number. 
Another basic premise of the modem quantum theory 
of the atom is the Pauli Exclusion Principle. This states 
that no two electrons in the same atom can have the 
same numerical values for their set of four quantum 
numbers. 

If we are to understand the way in which the 
Periodic Table of the chemical elements is built up 
in terms of the electronic structure of the atoms, 
we must now consider the significance of the four 
quantum numbers and the limitations placed upon 
the numerical values that they can assume. The most 
important quantum number is the principal quantum 
number since it is mainly responsible for determining 
the energy of the electron. The principal quantum 
number can have integral values beginning with n = 1, 
which is the state of lowest energy, and electrons 
having this value are the most stable, the stability 
decreasing as n increases. Electrons having a principal 
quantum number n can take up integral values of 
the orbital quantum number / between 0 and (n — 1). 
Thus if n = 1, / can only have the value 0, while for 
n = 2, / = 0 or 1, and for n = 3, / = 0, 1 or 2. The 
orbital quantum number is associated with the angular 
momentum of the revolving electron, and determines 
what would be regarded in non-quantum mechanical 
terms as the shape of the orbit. For a given value of 
n , the electron having the lowest value of / will have 
the lowest energy, and the higher the value of /, the 
greater will be the energy. 

The remaining two quantum numbers m and s are 
concerned, respectively, with the orientation of the 
electron’s orbit round the nucleus, and with the ori- 
entation of the direction of spin of the electron. For a 
given value of /, an electron may have integral values 
of the inner quantum number m from +/ through 0 
to — /. Thus for / = 2, m can take on the values +2, 
+ 1, 0, —1 and —2. The energies of electrons having 
the same values of n and / but different values of 
m are the same, provided there is no magnetic field 
present. When a magnetic field is applied, the energies 
of electrons having different m values will be altered 
slightly, as is shown by the splitting of spectral lines in 
the Zeeman effect. The spin quantum number s may, 
for an electron having the same values of n, / and m, 
take one of two values, that is, or — The fact 
that these are non-integral values need not concern us 
for the present purpose. We need only remember that 
two electrons in an atom can have the same values 
for the three quantum numbers n, l and m, and that 
these two electrons will have their spins oriented in 
opposite directions. Only in a magnetic field will the 
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Table 1.1 Allocation of states in the first three quantum shells 


Shell 

n 

/ 

m 

5 

Number of 
states 

Maximum number 
of electrons in shell 

1st 







K 

1 

0 

0 

±1/2 

Two 1 5-states 

2 



0 

0 

±1/2 

Two 25-states 


2nd 



+ 1 

±1/2 


8 

L 

2 

1 

0 

±1/2 

Six 2/7-states 





-1 

±1/2 





0 

0 

±1/2 

Two 35-states 


3rd 



+1 

±1/2 



M 


1 

0 

±1/2 

Six 3/7-states 





-1 

±1/2 




3 





18 




+2 

±1/2 






+ 1 

±1/2 





2 

0 

±1/2 

Ten 3d-states 





-1 

±1/2 






-2 

±1/2 




energies of the two electrons of opposite spin be dif- 
ferent. 

1.2.2 Nomenclature for the electronic states 

Before discussing the way in which the periodic clas- 
sification of the elements can be built up in terms of 
the electronic structure of the atoms, it is necessary 
to outline the system of nomenclature which enables 
us to describe the states of the electrons in an atom. 
Since the energy of an electron is mainly determined 
by the values of the principal and orbital quantum 
numbers, it is only necessary to consider these in our 
nomenclature. The principal quantum number is sim- 
ply expressed by giving that number, but the orbital 
quantum number is denoted by a letter. These letters, 
which derive from the early days of spectroscopy, are 
5, /?, d and /, which signify that the orbital quantum 
numbers / are 0, 1, 2 and 3, respectively. 1 

When the principal quantum number n = 1, / must 
be equal to zero, and an electron in this state would 
be designated by the symbol Is. Such a state can 
only have a single value of the inner quantum number 
m = 0, but can have values of + \ or — ^ for the spin 
quantum number s. It follows, therefore, that there 
are only two electrons in any one atom which can 
be in a 1 5-state, and that these electrons will spin in 
opposite directions. Thus when n = 1, only 5-states 

! The letters, 5, p, d and / arose from a classification of 
spectral lines into four groups, termed sharp, principal, 
diffuse and fundamental in the days before the present 
quantum theory was developed. 


can exist and these can be occupied by only two 
electrons. Once the two 1 5- states have been filled, 
the next lowest energy state must have n = 2. Here 
/ may take the value 0 or 1, and therefore electrons 
can be in either a 2s-ov a 2 /7-state. The energy of 
an electron in the 25-state is lower than in a 2/7- 
state, and hence the 25-states will be filled first. Once 
more there are only two electrons in the 2s- state, and 
indeed this is always true of 5-states, irrespective of the 
value of the principal quantum number. The electrons 
in the /7-state can have values of m = +1, 0, — 1, 
and electrons having each of these values for m can 
have two values of the spin quantum number, leading 
therefore to the possibility of six electrons being in 
any one /7-state. These relationships are shown more 
clearly in Table 1.1. 

No further electrons can be added to the state for 
n = 2 after two 2s- and six 2/7-state are filled, and 
the next electron must go into the state for which 
n = 3, which is at a higher energy. Here the possibility 
arises for / to have the values 0, 1 and 2 and hence, 
besides s- and /7-states, d-states for which / = 2 can 
now occur. When / = 2, m may have the values 
+2, +1,0, —1, —2 and each may be occupied by two 
electrons of opposite spin, leading to a total of ten d- 
states. Finally, when n = 4, / will have the possible 
values from 0 to 4, and when / = 4 the reader may 
verify that there are fourteen 4/-states. 

Table 1 . 1 shows that the maximum number of elec- 
trons in a given shell is 2 n 1 . It is accepted practice to 
retain an earlier spectroscopic notation and to label the 
states for which n = 1, 2, 3, 4, 5, 6 as K-, L-, M- N-, 
O- and P-shells, respectively. 
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1.3 The Periodic Table 

The Periodic Table provides an invaluable classifi- 
cation of all chemical elements, an element being a 
collection of atoms of one type. A typical version is 
shown in Table 1.2. Of the 107 elements which appear, 
about 90 occur in nature; the remainder are produced 
in nuclear reactors or particle accelerators. The atomic 
number (Z) of each element is stated, together with 
its chemical symbol, and can be regarded as either 
the number of protons in the nucleus or the num- 
ber of orbiting electrons in the atom. The elements 
are naturally classified into periods (horizontal rows), 
depending upon which electron shell is being filled, 
and groups (vertical columns). Elements in any one 
group have the electrons in their outermost shell in the 
same configuration, and, as a direct result, have similar 
chemical properties. 

The building principle ( Aufbauprinzip ) for the Table 
is based essentially upon two rules. First, the Pauli 
Exclusion Principle (Section 1.2.1) must be obeyed. 
Second, in compliance with Hund’s rule of max- 
imum multiplicity, the ground state should always 
develop maximum spin. This effect is demonstrated 
diagrammatically in Figure 1.2. Suppose that we sup- 
ply three electrons to the three ‘empty’ 2/?-orbitals. 
They will build up a pattern of parallel spins (a) rather 
than paired spins (b). A fourth electron will cause 
pairing (c). Occasionally, irregularities occur in the 
‘filling’ sequence for energy states because electrons 
always enter the lowest available energy state. Thus, 
45-states, being at a lower energy level, fill before the 
3d-states. 

We will now examine the general process by which 
the Periodic Table is built up, electron by electron, in 
closer detail. The progressive filling of energy states 
can be followed in Table 1.3. The first period com- 
mences with the simple hydrogen atom which has a 
single proton in the nucleus and a single orbiting elec- 
tron (Z = 1). The atom is therefore electrically neu- 
tral and for the lowest energy condition, the electron 
will be in the 1 5-state. In helium, the next element, 
the nucleus charge is increased by one proton and 
an additional electron maintains neutrality (Z = 2). 
These two electrons fill the 1 5-state and will nec- 
essarily have opposite spins. The nucleus of helium 
contains two neutrons as well as two protons, hence 


its mass is four times greater than that of hydrogen. 
The next atom, lithium, has a nuclear charge of three 
(Z = 3) and, because the first shell is full, an electron 
must enter the 25-state which has a somewhat higher 
energy. The electron in the 25-state, usually referred 
to as the valency electron, is ‘shielded’ by the inner 
electrons from the attracting nucleus and is therefore 
less strongly bonded. As a result, it is relatively easy 
to separate this valency electron. The ‘electron core’ 
which remains contains two tightly-bound electrons 
and, because it carries a single net positive charge, 
is referred to as a monovalent cation. The overall pro- 
cess by which electron(s) are lost or gained is known 
as ionization. 

The development of the first short period from 
lithium (Z = 3) to neon (Z = 10) can be conveniently 
followed by referring to Table 1.3. So far, the sets of 
states corresponding to two principal quantum num- 
bers (n = 1 , n = 2) have been filled and the electrons 
in these states are said to have formed closed shells. It 
is a consequence of quantum mechanics that, once a 
shell is filled, the energy of that shell falls to a very low 
value and the resulting electronic configuration is very 
stable. Thus, helium, neon, argon and krypton are asso- 
ciated with closed shells and, being inherently stable 
and chemically unreactive, are known collectively as 
the inert gases. 

The second short period, from sodium (Z = 1 1 ) to 
argon (Z = 18), commences with the occupation of 
the 35-orbital and ends when the 3/?-orbitals are full 
(Table 1.3). The long period which follows extends 
from potassium (Z = 19) to krypton (Z = 36), and, as 
mentioned previously, has the unusual feature of the 
45-state filling before the 3d-state. Thus, potassium has 
a similarity to sodium and lithium in that the electron 
of highest energy is in an 5-state; as a consequence, 
they have very similar chemical reactivities, forming 
the group known as the alkali-metal elements. After 
calcium (Z = 20), filling of the 3d-state begins. 

The 45-state is filled in calcium (Z = 20) and 
the filling of the 3d-state becomes energetically 
favourable to give scandium (Z = 21). This belated 
filling of the five 3d-orbitals from scandium to its 
completion in copper (Z = 29) embraces the first 
series of transition elements. One member of this 
series, chromium (Z = 24), obviously behaves in an 
unusual manner. Applying Hund’ s rule, we can reason 


/I 


/I 


/I 


Correct (3e) 

(a) 



Incorrect (3e) 

(b) 



Figure 1.2 Application of Hund’s multiplicity rule to the electron-filling of energy states. 




Table 1.2 The Periodic Table of the elements (from Puddephatt and 


1234 5 6 789 


IA 

IIA 

IIIA 

IVA 

VA 

VIA 

VIIA 

VIII 

iH 

1.008 









3 Li 

6.941 

4 Be 

9.012 








nNa 

22.99 

i 2 Mg 

24.31 








19K 

39.10 

20 Ca 
40.08 

21 Sc 
44.96 

22 Ti 
47.90 

23V 

50.94 

24 Cr 
52.00 

25 Mn 

54.94 

26Fe 

55.85 

27C0 

58.93 

37 Rb 

85.47 

OO u> 

39Y 

88.91 

40 Zr 
91.22 

4lNb 

92.91 

42 Mo 

95.94 

43 TC 

98.91 

44 Ru 

101.1 

45 Rh 

102.9 

55CS 

132.9 

56La 

137.3 

57 La 
138.9 

72 Hf 

178.5 

73 Ta 

180.9 

74W 

183.9 

75 Re 
186.2 

76 Os 

190.2 

77 Ir 

192.2 

svFr 

(223) 

88 R a 
(226.0) 

89AC 

(227) 

i 04 Unq 

losUnp 

i 06 Unh 

i 07 Uns 



5-block < d -block 


Lanthanides 

57 La 

138.9 

ssCe 

140.1 

59 Rr 

140.9 

60 Nd 

144.2 

6i Pm 

(147) 

62 Sm 
150.4 

Actinides 

89 AC 

90 Th 

9 iPa 

92 U 

93 Np 

94PU 
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Table 1.3 Electron quantum numbers (Hume-Rothery, Smallman and Haworth, 1988) 


Element 












52 Te 

2 

8 

18 

2 

6 

10 


2 

4 




and 












53 I 

2 

8 

18 

2 

6 

10 


2 

5 




atomic 












54 Xe 

2 

8 

18 

2 

6 

10 


2 

6 




number 

Principal and secondary quantum numbers 


55 Cs 

2 

8 

18 

2 

6 

10 


2 

6 


1 














56 Ba 

2 

8 

18 

2 

6 

10 


2 

6 


2 


n = 

1 

2 


3 



4 





57 La 

2 

8 

18 

2 

6 

10 


2 

6 

1 

2 


l = 

0 

0 

1 

0 

1 

2 

0 

1 

2 

3 


58 Ce 

2 

8 

18 

2 

6 

10 

2 

2 

6 


2 














59 Pr 

2 

8 

18 

2 

6 

10 

3 

2 

6 


2 


1 H 

1 











60 Nd 

2 

8 

18 

2 

6 

10 

4 

2 

6 


2 


2 He 

2 











61 Pm 

2 

8 

18 

2 

6 

10 

5 

2 

6 


2 


3 Li 

2 

1 










62 Sm 

2 

8 

18 

2 

6 

10 

6 

2 

6 


2 


4 Be 

2 

2 










63 Eu 

2 

8 

18 

2 

6 

10 

7 

2 

6 


2 


5 B 

2 

2 

1 









64 Gd 

2 

8 

18 

2 

6 

10 

7 

2 

6 

1 

2 


6 C 

2 

2 

2 









65 Tb 

2 

8 

18 

2 

6 

10 

9 

2 

6 


2 


7 N 

2 

2 

3 









66 Dy 

2 

8 

18 

2 

6 

10 

10 

2 

6 


2 


8 O 

2 

2 

4 









67 Ho 

2 

8 

18 

2 

6 

10 

11 

2 

6 


2 


9 F 

2 

2 

5 









68 Er 

2 

8 

18 

2 

6 

10 

12 

2 

6 


2 


10 Ne 

2 

2 

6 









69 Tm 

2 

8 

18 

2 

6 

10 

13 

2 

6 


2 


11 Na 

2 

2 

6 

1 








70 Yb 

2 

8 

18 

2 

6 

10 

14 

2 

6 


2 


12 Mg 

2 

2 

6 

2 








71 Lu 

2 

8 

18 

2 

6 

10 

14 

2 

6 

1 

2 


13 A\ 

2 

2 

6 

2 

1 







72 Hf 

2 

8 

18 

2 

6 

10 

14 

2 

6 

2 

2 


14 Si 

2 

2 

6 

2 

2 








— 












15 P 

2 

2 

6 

2 

3 







n = 

1 

2 

3 

4 

5 




6 



7 

16 S 

2 

2 

6 

2 

4 







l =- 

— 

— 

— 

— 

0 

1 

2 

3 

0 

1 

2 

0 

17 Cl 

2 

2 

6 

2 

5 








— 












18 A 

2 

2 

6 

2 

6 







73 Ta 

2 

8 

18 

32 

2 

6 

3 


2 




19 K 

2 

2 

6 

2 

6 


1 





74 W 

2 

8 

18 

32 

2 

6 

4 


2 




20 Ca 

2 

2 

6 

2 

6 


2 





75 Re 

2 

8 

18 

32 

2 

6 

5 


2 




21 Sc 

2 

2 

6 

2 

6 

1 

2 





76 Os 

2 

8 

18 

32 

2 

6 

6 


2 




22 Ti 

2 

2 

6 

2 

6 

2 

2 





77 Ir 

2 

8 

18 

32 

2 

6 

7 


2 




23 V 

2 

2 

6 

2 

6 

3 

2 





78 Pt 

2 

8 

18 

32 

2 

6 

9 


1 




24 Cr 

2 

2 

6 

2 

6 

5 

1 





79 Au 

2 

8 

18 

32 

2 

6 

10 


1 




25 Mn 

2 

2 

6 

2 

6 

5 

2 





80 Hg 

2 

8 

18 

32 

2 

6 

10 


2 




26 Fe 

2 

2 

6 

2 

6 

6 

2 





81 Tl 

2 

8 

18 

32 

2 

6 

10 


2 

1 



27 Co 

2 

2 

6 

2 

6 

7 

2 





82 Pb 

2 

8 

18 

32 

2 

6 

10 


2 

2 



28 Ni 

2 

2 

6 

2 

6 

8 

2 





83 Bi 

2 

8 

18 

32 

2 

6 

10 


2 

3 



29 Cu 

2 

2 

6 

2 

6 

10 

1 





84 Po 

2 

8 

18 

32 

2 

6 

10 


2 

4 



30 Zn 

2 

2 

6 

2 

6 

10 

2 





85 At 

2 

8 

18 

32 

2 

6 

10 


2 

5 



31 Ga 

2 

2 

6 

2 

6 

10 

2 

1 




86 Rn 

2 

8 

18 

32 

2 

6 

10 


2 

6 



32 Ge 

2 

2 

6 

2 

6 

10 

2 

2 




87 Fr 

2 

8 

18 

32 

2 

6 

10 


2 

6 


1 

33 As 

2 

2 

6 

2 

6 

10 

2 

3 




88 Ra 

2 

8 

18 

32 

2 

6 

10 


2 

6 


2 

34 Se 

2 

2 

6 

2 

6 

10 

2 

4 




89 Ac 

2 

8 

18 

32 

2 

6 

10 


2 

6 

1 

2 

35 Br 

2 

2 

6 

2 

6 

10 

2 

5 




90 Th 

2 

18 

8 

32 

2 

6 

10 


2 

6 

2 

2 

36 Kr 

2 

2 

6 

2 

6 

10 

2 

6 




91 Pa 

2 

18 

8 

32 

2 

6 

10 

2 

2 

6 

1 

2 













92 U 

2 

18 

8 

32 

2 

6 

10 

3 

2 

6 

1 

2 

n = 

1 

2 

3 

4 




5 



6 

93 Np 

2 

18 

8 

32 

2 

6 

10 

4 

2 

6 

1 

2 

l =- 

— 

— 

— 

0 

1 

2 

3 

0 

1 

2 

0 

94 Pu 

2 

18 

8 

32 

2 

6 

10 

5 

2 

6 

1 

2 


37 Rb 

2 

8 

18 

2 

6 


1 


The exact electronic configurations of the later elements 

38 Sr 

2 

8 

18 

2 

6 


2 


are not always certain but the most probable arrangements 

39 Y 

2 

8 

18 

2 

6 

1 

2 


of the outer electrons are: 

40 Zr 

2 

8 

18 

2 

6 

2 

2 




41 Nb 

2 

8 

18 

2 

6 

4 

1 


95 Am 

(5/) 7 (7s) 2 

42 Mo 

2 

8 

18 

2 

6 

5 

1 


96 Cm 

(5f) 1 (6d) l (ls) 2 

43 Tc 

44 Ru 

45 Rh 

46 Pd 

2 

2 

2 

2 

8 

8 

8 

8 

18 

18 

18 

18 

2 

2 

2 

2 

6 

6 

6 

6 

5 

7 

8 

10 

2 

1 

1 


97 Bk 

98 Cf 

99 Es 

(5f)\6d)\ls) 2 

(5/) 1( W 

(5/) n (7s) 2 

47 Ag 

2 

8 

18 

2 

6 

10 

1 


100 Fm 

(5/) 12 (7s) 2 

48 Cd 

2 

8 

18 

2 

6 

10 

2 


101 Md 

(5/) 13 (7s) 2 

49 In 

2 

8 

18 

2 

6 

10 

2 

1 

102 No 

(5/) 14 (7s) 2 

50 Sn 

2 

8 

18 

2 

6 

10 

2 

2 

103 Lw 

(5f) u m l (ls) 2 

51 Sb 

2 

8 

18 

2 

6 

10 

2 

3 

104 — 

(5f) u (6d) 2 0s) 2 



The structure and bonding of atoms 7 


that maximization of parallel spin is achieved by 
locating six electrons, of like spin, so that five fill 
the 3d-states and one enters the ^-state. This mode 
of fully occupying the 3d-states reduces the energy 
of the electrons in this shell considerably. Again, in 
copper (Z = 29), the last member of this transition 
series, complete filling of all 3d-orbitals also produces 
a significant reduction in energy. It follows from these 
explanations that the 3d- and 4^-levels of energy are 
very close together. After copper, the energy states fill 
in a straightforward manner and the first long period 
finishes with krypton (Z = 36). It will be noted that 
lanthanides (Z = 57 to 71) and actinides (Z = 89 to 
103), because of their state-filling sequences, have 
been separated from the main body of Table 1.2. 
Having demonstrated the manner in which quantum 
rules are applied to the construction of the Periodic 
Table for the first 36 elements, we can now examine 
some general aspects of the classification. 

When one considers the small step difference 
of one electron between adjacent elements in the 
Periodic Table, it is not really surprising to find 
that the distinction between metallic and non-metallic 
elements is imprecise. In fact there is an intermediate 
range of elements, the metalloids, which share the 
properties of both metals and non-metals. However, 
we can regard the elements which can readily lose an 
electron, by ionization or bond formation, as strongly 
metallic in character (e.g. alkali metals). Conversely, 
elements which have a strong tendency to acquire an 
electron and thereby form a stable configuration of 
two or eight electrons in the outermost shell are non- 
metallic (e.g. the halogens fluorine, chlorine, bromine, 
iodine). Thus electropositive metallic elements and 
the electronegative non-metallic elements lie on the 
left- and right-hand sides of the Periodic Table, 
respectively. As will be seen later, these and other 
aspects of the behaviour of the outermost (valence) 
electrons have a profound and determining effect upon 
bonding and therefore upon electrical, magnetic and 
optical properties. 

Prior to the realization that the frequently observed 
periodicities of chemical behaviour could be expressed 
in terms of electronic configurations, emphasis was 
placed upon ‘atomic weight’. This quantity, which 
is now referred to as relative atomic mass, increases 
steadily throughout the Periodic Table as protons 
and neutrons are added to the nuclei. Atomic mass 1 
determines physical properties such as density, spe- 
cific heat capacity and ability to absorb electromag- 
netic radiation: it is therefore very relevant to engi- 
neering practice. For instance, many ceramics are 
based upon the light elements aluminium, silicon and 
oxygen and consequently have a low density, i.e. 
<3000 kg m -3 . 

Atomic mass is now expressed relative to the datum value 
for carbon (12.01). Thus, a copper atom has 63.55/12.01 or 
5.29 times more mass than a carbon atom. 


1.4 Interatomic bonding in materials 

Matter can exist in three states and as atoms change 
directly from either the gaseous state (desublimation) 
or the liquid state (solidification) to the usually 
denser solid state, the atoms form aggregates in three- 
dimensional space. Bonding forces develop as atoms 
are brought into proximity to each other. Sometimes 
these forces are spatially-directed. The nature of the 
bonding forces has a direct effect upon the type of 
solid structure which develops and therefore upon 
the physical properties of the material. Melting point 
provides a useful indication of the amount of thermal 
energy needed to sever these interatomic (or interionic) 
bonds. Thus, some solids melt at relatively low 
temperatures (m.p. of tin = 232°C) whereas many 
ceramics melt at extremely high temperatures (m.p. of 
alumina exceeds 2000° C). It is immediately apparent 
that bond strength has far-reaching implications in all 
fields of engineering. 

Customarily we identify four principal types of 
bonding in materials, namely, metallic bonding, ionic 
bonding, covalent bonding and the comparatively 
much weaker van der Waals bonding. However, in 
many solid materials it is possible for bonding to be 
mixed, or even intermediate, in character. We will first 
consider the general chemical features of each type of 
bonding; in Chapter 2 we will examine the resultant 
disposition of the assembled atoms (ions) in three- 
dimensional space. 

As we have seen, the elements with the most pro- 
nounced metallic characteristics are grouped on the 
left-hand side of the Periodic Table (Table 1.2). In 
general, they have a few valence electrons, outside 
the outermost closed shell, which are relatively easy 
to detach. In a metal, each ‘free’ valency electron is 
shared among all atoms, rather than associated with an 
individual atom, and forms part of the so-called ‘elec- 
tron gas’ which circulates at random among the regular 
array of positively-charged electron cores, or cations 
(Figure 1.3a). Application of an electric potential gra- 
dient will cause the ‘gas’ to drift though the structure 
with little hindrance, thus explaining the outstanding 
electrical conductivity of the metallic state. The metal- 
lic bond derives from the attraction between the cations 
and the free electrons and, as would be expected, repul- 
sive components of force develop when cations are 
brought into close proximity. However, the bonding 
forces in metallic structures are spatially non-directed 
and we can readily simulate the packing and space- 
filling characteristics of the atoms with modelling sys- 
tems based on equal-sized spheres (polystyrene balls, 
even soap bubbles). Other properties such as ductility, 
thermal conductivity and the transmittance of electro- 
magnetic radiation are also directly influenced by the 
non-directionality and high electron mobility of the 
metallic bond. 

The ionic bond develops when electron(s) are trans- 
ferred from atoms of active metallic elements to atoms 
of active non-metallic elements, thereby enabling each 
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(a) Sodium (Z = 11) 



(c) Carbon (Z = 6) 


Figure 1.3 Schematic representation of (a) metallic bonding, 
bonding. 



(b) Magnesium (Z = 12) 
and oxygen (Z = 8) 



(d) Polarized atoms 

ionic bonding, (c) covalent bonding and (d) van der Waals 


of the resultant ions to attain a stable closed shell. 
For example, the ionic structure of magnesia (MgO), 
a ceramic oxide, forms when each magnesium atom 
(Z = 12) loses two electrons from its L-shell (n = 2) 
and these electrons are acquired by an oxygen atom 
(Z = 8), producing a stable octet configuration in its 
L-shell (Table 1.3). Overall, the ionic charges balance 
and the structure is electrically neutral (Figure 1.3b). 
Anions are usually larger than cations. Ionic bonding 
is omnidirectional, essentially electrostatic in charac- 
ter and can be extremely strong; for instance, magnesia 
is a very useful refractory oxide (m.p. = 2930°C). At 
low to moderate temperatures, such structures are elec- 
trical insulators but, typically, become conductive at 
high temperatures when thermal agitation of the ions 
increases their mobility. 

Sharing of valence electrons is the key feature of 
the third type of strong primary bonding. Covalent 
bonds form when valence electrons of opposite spin 
from adjacent atoms are able to pair within overlapping 
spatially-directed orbitals, thereby enabling each atom 
to attain a stable electronic configuration (Figure 1.3 c). 


Being oriented in three-dimensional space, these local- 
ized bonds are unlike metallic and ionic bonds. Fur- 
thermore, the electrons participating in the bonds are 
tightly bound so that covalent solids, in general, have 
low electrical conductivity and act as insulators, some- 
times as semiconductors (e.g. silicon). Carbon in the 
form of diamond is an interesting prototype for cova- 
lent bonding. Its high hardness, low coefficient of ther- 
mal expansion and very high melting point (3300°C) 
bear witness to the inherent strength of the cova- 
lent bond. First, using the (8 - N) Rule, in which 
N is the Group Number 1 in the Periodic Table, we 
deduce that carbon (Z = 6) is tetravalent; that is, four 
bond-forming electrons are available from the L-shell 
(n = 2). In accordance with Hund’s Rule (Figure 1.2), 
one of the two electrons in the 2s-state is promoted to a 
higher 2 /7-state to give a maximum spin condition, pro- 
ducing an overall configuration of Is 2 2 s l 2 p 3 in the 
carbon atom. The outermost second shell accordingly 

According to previous IUPAC notation: see top of 
Table 1.2. 
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has four valency electrons of like spin available for 
pairing. Thus each carbon atom can establish electron- 
sharing orbitals with four neighbours. For a given 
atom, these four bonds are of equal strength and are 
set at equal angles (109.5°) to each other and therefore 
exhibit tetrahedral symmetry. (The structural conse- 
quences of this important feature will be discussed in 
Chapter 2.) 

This process by which ^-orbitals and /7-orbitals 
combine to form projecting hybrid 577 -orbitals is known 
as hybridization. It is observed in elements other than 
carbon. For instance, tri valent boron (Z = 5) forms 
three co-planar sp 2 -orbitals. In general, a large degree 
of overlap of 577 -orbitals and/or a high electron density 
within the overlap ‘cloud’ will lead to an increase 
in the strength of the covalent bond. As indicated 
earlier, it is possible for a material to possess more than 
one type of bonding. For example, in calcium silicate 
(Ca 2 Si0 4 ), calcium cations Ca 2+ are ionically bonded 
to tetrahedral SiC> 4 4- clusters in which each silicon 
atom is covalently -bonded to four oxygen neighbours. 

The final type of bonding is attributed to the van- 
der Waals forces which develop when adjacent atoms, 
or groups of atoms, act as electric dipoles. Suppose 
that two atoms which differ greatly in size combine to 
form a molecule as a result of covalent bonding. The 
resultant electron ‘cloud’ for the whole molecule can 
be pictured as pear-shaped and will have an asymmet- 
rical distribution of electron charge. An electric dipole 
has formed and it follows that weak directed forces 
of electrostatic attraction can exist in an aggregate 
of such molecules (Figure 1.3d). There are no ‘free’ 
electrons hence electrical conduction is not favoured. 
Although secondary bonding by van der Waals forces 
is weak in comparison to the three forms of primary 
bonding, it has practical significance. For instance, 
in the technologically-important mineral talc, which 
is hydrated magnesium silicate Mg 3 Si 40 io(OH) 2 , the 
parallel covalently-bonded layers of atoms are attracted 
to each other by van der Waals forces. These layers can 
easily be slid past each other, giving the mineral its 
characteristically slippery feel. In thermoplastic poly- 
mers, van der Waals forces of attraction exist between 
the extended covalently-bonded hydrocarbon chains; a 
combination of heat and applied shear stress will over- 
come these forces and cause the molecular chains to 
glide past each other. To quote a more general case, 
molecules of water vapour in the atmosphere each 
have an electric dipole and will accordingly tend to 
be adsorbed if they strike solid surfaces possessing 
attractive van der Waals forces (e.g. silica gel). 

1.5 Bonding and energy levels 

If one imagines atoms being brought together uni- 
formly to form, for example, a metallic structure, 
then when the distance between neighbouring atoms 
approaches the interatomic value the outer electrons 
are no longer localized around individual atoms. Once 


the outer electrons can no longer be considered to be 
attached to individual atoms but have become free to 
move throughout the metal then, because of the Pauli 
Exclusion Principle, these electrons cannot retain the 
same set of quantum numbers that they had when they 
were part of the atoms. As a consequence, the free 
electrons can no longer have more than two electrons 
of opposite spin with a particular energy. The energies 
of the free electrons are distributed over a range which 
increases as the atoms are brought together to form 
the metal. If the atoms when brought together are to 
form a stable metallic structure, it is necessary that the 
mean energy of the free electrons shall be lower than 
the energy of the electron level in the free atom from 
which they are derived. Figure 1.4 shows the broaden- 
ing of an atomic electron level as the atoms are brought 
together, and also the attendant lowering of energy of 
the electrons. It is the extent of the lowering in mean 
energy of the outer electrons that governs the stability 
of a metal. The equilibrium spacing between the atoms 
in a metal is that for which any further decrease in the 
atomic spacing would lead to an increase in the repul- 
sive interaction of the positive ions as they are forced 
into closer contact with each other, which would be 
greater than the attendant decrease in mean electron 
energy. 

In a metallic structure, the free electrons must, 
therefore, be thought of as occupying a series of 
discrete energy levels at very close intervals. Each 
atomic level which splits into a band contains the same 
number of energy levels as the number N of atoms 
in the piece of metal. As previously stated, only two 
electrons of opposite spin can occupy any one level, so 
that a band can contain a maximum of 2 N electrons. 
Clearly, in the lowest energy state of the metal all the 
lower energy levels are occupied. 

The energy gap between successive levels is not 
constant but decreases as the energy of the levels 
increases. This is usually expressed in terms of the 
density of electronic states N(E) as a function of the 
energy E. The quantity N(E)dE gives the number of 



Figure 1.4 Broadening of atomic energy levels in a metal. 
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energy levels in a small energy interval d E, and for 
free electrons is a parabolic function of the energy, as 
shown in Figure 1.5. 

Because only two electrons can occupy each level, 
the energy of an electron occupying a low-energy 
level cannot be increased unless it is given sufficient 
energy to allow it to jump to an empty level at the 
top of the band. The energy 1 width of these bands is 
commonly about 5 or 6 eV and, therefore, considerable 
energy would have to be put into the metal to excite 
a low-lying electron. Such energies do not occur at 
normal temperatures, and only those electrons with 
energies close to that of the top of the band (known 




Energy — 

(a) (b) 

Figure 1.5 (a) Density of energy levels plotted against 
energy; (b) filling of energy levels by electrons at absolute 
zero. At ordinary temperatures some of the electrons are 
thermally excited to higher levels than that corresponding to 
E max as shown by the broken curve in (a). 

! An electron volt is the kinetic energy an electron acquires 
in falling freely through a potential difference of 1 volt 
(1 eV = 1.602 x 1(T 19 J; 1 eV per 
particle = 23 050 x 4.186 J per mol of particles). 


as the Fermi level and surface) can be excited, and 
therefore only a small number of the free electrons 
in a metal can take part in thermal processes. The 
energy of the Fermi level E F depends on the number 
of electrons N per unit volume V, and is given by 
(h 2 /8m)(3N/nV) 2 ' 3 . 

The electron in a metallic band must be thought 
of as moving continuously through the structure with 
an energy depending on which level of the band it 
occupies. In quantum mechanical terms, this motion 
of the electron can be considered in terms of a wave 
with a wavelength which is determined by the energy 
of the electron according to de Broglie’s relationship 
X = h/mv, where h is Planck’s constant and m and v 
are, respectively, the mass and velocity of the moving 
electron. The greater the energy of the electron, the 
higher will be its momentum mv, and hence the smaller 
will be the wavelength of the wave function in terms 
of which its motion can be described. Because the 
movement of an electron has this wave-like aspect, 
moving electrons can give rise, like optical waves, to 
diffraction effects. This property of electrons is used 
in electron microscopy (Chapter 5). 
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Chapter 2 

Atomic arrangements in materials 


2.1 The concept of ordering 

When attempting to classify a material it is useful to 
decide whether it is crystalline (conventional metals 
and alloys), non-crystalline (glasses) or a mixture of 
these two types of structure. The critical distinction 
between the crystalline and non-crystalline states 
of matter can be made by applying the concept 
of ordering. Figure 2.1a shows a symmetrical two- 
dimensional arrangement of two different types of 


atom. A basic feature of this aggregate is the nesting of 
a small atom within the triangular group of three much 
larger atoms. This geometrical condition is called 
short-range ordering. Furthermore, these triangular 
groups are regularly arranged relative to each other 
so that if the aggregate were to be extended, we 
could confidently predict the locations of any added 
atoms. In effect, we are taking advantage of the long- 
range ordering characteristic of this array. The array 
of Figure 2.1a exhibits both short- and long-range 



Figure 2.1 Atomic ordering in (a) crystals and (b) glasses of the same composition (from Kingery, Bowen and Uhlmann, 
1976; by permission of Wiley -Inter science). 
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ordering and is typical of a single crystal. In the other 
array of Figure 2.1b, short-range order is discernible 
but long-range order is clearly absent. This second type 
of atomic arrangement is typical of the glassy state. 1 

It is possible for certain substances to exist in 
either crystalline or glassy forms (e.g. silica). From 
Figure 2.1 we can deduce that, for such a substance, 
the glassy state will have the lower bulk density. 
Furthermore, in comparing the two degrees of ordering 
of Figures 2.1a and 2.1b, one can appreciate why the 
structures of comparatively highly-ordered crystalline 
substances, such as chemical compounds, minerals and 
metals, have tended to be more amenable to scientific 
investigation than glasses. 

2.2 Crystal lattices and structures 

We can rationalize the geometry of the simple repre- 
sentation of a crystal structure shown in Figure 2.1a 
by adding a two-dimensional frame of reference, or 
space lattice, with line intersections at atom centres. 
Extending this process to three dimensions, we can 
construct a similar imaginary space lattice in which 
triple intersections of three families of parallel equidis- 
tant lines mark the positions of atoms (Figure 2.2a). 
In this simple case, three reference axes (x, y, z) are 
oriented at 90° to each other and atoms are ‘shrunk’, 
for convenience. The orthogonal lattice of Figure 2.2a 
defines eight unit cells, each having a shared atom at 
every comer. It follows from our recognition of the 
inherent order of the lattice that we can express the 

! The terms glassy, non-crystalline, vitreous and amorphous 
are synonymous. 


geometrical characteristics of the whole crystal, con- 
taining millions of atoms, in terms of the size, shape 
and atomic arrangement of the unit cell, the ultimate 
repeat unit of structure. 2 

We can assign the lengths of the three cell 
parameters ( a , b , c ) to the reference axes, using an 
internationally- accepted notation (Figure 2.2b). Thus, 
for the simple cubic case portrayed in Figure 2.2a, x = 
y = z = 90°; a = b = c. Economizing in symbols, we 
only need to quote a single cell parameter ( a ) for the 
cubic unit cell. By systematically changing the angles 
(a, /3, y) between the reference axes, and the cell 
parameters (a, b, c), and by four skewing operations, 
we derive the seven crystal systems (Figure 2.3). Any 
crystal, whether natural or synthetic, belongs to one 
or other of these systems. From the premise that 
each point of a space lattice should have identical 
surroundings, Bravais demonstrated that the maximum 
possible number of space lattices (and therefore unit 
cells) is 14. It is accordingly necessary to augment 
the seven primitive (P) cells shown in Figure 2.3 with 
seven more non-primitive cells which have additional 
face-centring, body-centring or end-centring lattice 
points. Thus the highly- symmetrical cubic system has 
three possible lattices: primitive (P), body-centred (I; 
from the German word innenzentrierte) and face- 
centred (F). We will encounter the latter two again in 
Section 2.5.1. True primitive space lattices, in which 

2 The notion that the striking external appearance of crystals 
indicates the existence of internal structural units with 
similar characteristics of shape and orientation was 
proposed by the French mineralogist Hauy in 1784. Some 
130 years elapsed before actual experimental proof was 
provided by the new technique of X-ray diffraction analysis. 




~L 


Figure 2.2 Principles of lattice construction. 
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System 

Axes 

Axial angles 

Cubic 

a 1 = a 2 = a 3 

All angles = 90 c 

Tetragonal 

a, = a 2 * c 

All angles = 90° 

Orthohombic 

a^b^c 

All angles = 90° 

Monoclinic 

a * b * c 

Two angles = 90°; 1 angle * 90° 

Triclinic 

a * b * c 

All angles different; none equal 90 c 

Hexagonal 

a, = a ? = a 3 * c 

Angles = 90° and 1 20° 

Rhombohedral 

a, = a 2 = a 3 

All angles equal, but not 90° 


Figure 2.3 The seven systems of crystal symmetry (S = skew operation ). 


each lattice point has identical surroundings, can 
sometimes embody awkward angles. In such cases it 
is common practice to use a simpler orthogonal non- 
primitive lattice which will accommodate the atoms of 
the actual crystal structure . 1 

lattices are imaginary and limited in number; crystal 
structures are real and virtually unlimited in their variety. 


2.3 Crystal directions and planes 

In a structurally-disordered material, such as fully- 
annealed silica glass, the value of a physical property 
is independent of the direction of measurement; the 
material is said to be isotropic. Conversely, in many 
single crystals, it is often observed that a structurally- 
sensitive property, such as electrical conductivity, is 
strongly direction-dependent because of variations in 
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Figure 2.4 Indexing of (a) directions and (b) planes in cubic crystals. 


the periodicity and packing of atoms. Such crystals 
are anisotropic. We therefore need a precise method 
for specifying a direction, and equivalent directions, 
within a crystal. The general method for defining a 
given direction is to construct a line through the origin 
parallel to the required direction and then to deter- 
mine the coordinates of a point on this line in terms 
of cell parameters ( a , b , c). Hence, in Figure 2.4a, 

the direction aS is obtained by noting the transla- 
tory movements needed to progress from the origin O 
to point C, i.e. a = l, b = l, c = l. These coordinate 
values are enclosed in square brackets to give the direc- 
tion indices [1 1 1]. In similar fashion, the direction D& 
can be shown to be [1/2 1 1] with the bar sign indi- 
cating use of a negative axis. Directions which are 
crystallographically equivalent in a given crystal are 
represented by angular brackets. Thus, (100) repre- 
sents all cube edge directions and comprises [100], 
[010], [001], [TOO], [OlO] and [OOl] directions. 
Directions are often represented in non-specific terms 
as [uvw] and (uvw). 

Physical events and transformations within crystals 
often take place on certain families of parallel equidis- 
tant planes. The orientation of these planes in three- 
dimensional space is of prime concern; their size and 
shape is of lesser consequence. (Similar ideas apply to 
the corresponding external facets of a single crystal.) 
In the Miller system for indexing planes, the intercepts 
of a representative plane upon the three axes (x, y, z) 
are noted. 1 Intercepts are expressed relatively in terms 
of a , b, c. Planes parallel to an axis are said to intercept 
at infinity. Reciprocals of the three intercepts are taken 
and the indices enclosed by round brackets. Hence, in 

! For mathematical reasons, it is advisable to carry out all 
indexing operations (translations for directions, intercepts 
for planes) in the strict sequence a, b, c. 


Figure 2.4b, the procedural steps for indexing the plane 
ABC are: 

a b c 

Intercepts 11 1 

Reciprocals \ \ \ 

Miller indices (111) 

The Miller indices for the planes DEFG and BCHI are 
(0 1 0) and (1 10), respectively. Often it is necessary 
to ignore individual planar orientations and to specify 
all planes of a given crystallographic type, such as 
the planes parallel to the six faces of a cube. These 
planes constitute a crystal form and have the same 
atomic configurations; they are said to be equivalent 
and can be represented by a single group of indices 
enclosed in curly brackets, or braces. Thus, {100} 
represents a form of six planar orientations, i.e. (100), 
(0 1 0), (00 1), (1 00), (0 1 0) and (001). Returning to 
the (1 1 1) plane ABC of Figure 2.4b, it is instructive 
to derive the other seven equivalent planes, centring on 
the origin O, which comprise {111}. It will then be 
seen why materials belonging to the cubic system often 
crystallize in an octahedral form in which octahedral 
{111} planes are prominent. 

It should be borne in mind that the general purpose 
of the Miller procedure is to define the orientation of 
a family of parallel equidistant planes; the selection 
of a convenient representative plane is a means to this 
end. For this reason, it is permissible to shift the origin 
provided that the relative disposition of a , b and c is 
maintained. Miller indices are commonly written in the 
symbolic form ( hkl ). Rationalization of indices, either 
to reduce them to smaller numbers with the same ratio 
or to eliminate fractions, is unnecessary. This often- 
recommended step discards information; after all, there 
is a real difference between the two families of planes 
(100) and (200). 
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Figure 2.5 Prismatic, basal and pyramidal planes in hexagonal structures. 


As mentioned previously, it is sometimes conve- 
nient to choose a non-primitive cell. The hexagonal 
structure cell is an important illustrative example. For 
reasons which will be explained, it is also appropri- 
ate to use a four- axis Miller-Bravais notation (hkil) 
for hexagonal crystals, instead of the three-axis Miller 
notation ( hkl ). In this alternative method, three axes 
(fli, a 2 , af) are arranged at 120° to each other in a 
basal plane and the fourth axis (c) is perpendicular 
to this plane (Figure 2.5a). Hexagonal structures are 
often compared in terms of the axial ratio c/a. The 
indices are determined by taking intercepts upon the 
axes in strict sequence. Thus the procedural steps for 
the plane ABCD, which is one of the six prismatic 
planes bounding the complete cell, are: 


a\ 

Intercepts 1 

Reciprocals \ 

Miller-Bravais indices 


a 2 a 3 c 

— 1 oo oo 

0 0 

(iToo) 


Comparison of these digits with those from other pris- 
matic planes such as (lOlO), (OlTO) and (llOO) 
immediately reveals a similarity; that is, they are crys- 
tallographically equivalent and belong to the {1010} 
form. The three-axis Miller method lacks this advan- 
tageous feature when applied to hexagonal structures. 
For geometrical reasons, it is essential to ensure that 
the plane indices comply with the condition (h + k) = 
—i. In addition to the prismatic planes, basal planes 
of (0001) type and pyramidal planes of the (1 121) 


type are also important features of hexagonal structures 
(Figure 2.5b). 

The Miller-Bravais system also accommodates 
directions, producing indices of the form [uvtw]. The 
first three translations in the basal plane must be 
carefully adjusted so that the geometrical condition 
u + v= —t applies. This adjustment can be facilitated 
by sub-dividing the basal planes into triangles 
(Figure 2.6). As before, equivalence is immediately 



Figure 2.6 Typical Miller-Bravais directions in (0001) 
basal plane of hexagonal crystal. 
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revealed; for instance, the close-packed directions in 
the basal plane have the indices [2 1 10], [1120], 
[1210], etc. and can be represented by (2110). 

2.4 Stereographic projection 

Projective geometry makes it possible to represent the 
relative orientation of crystal planes and directions 
in three-dimensional space in a more convenient 
two-dimensional form. The standard stereographic 
projection is frequently used in the analysis of crystal 
behaviour; X-ray diffraction analyses usually provide 
the experimental data. Typical applications of the 
method are the interpretation of strain markings on 
crystal surfaces, portrayal of symmetrical relationships, 
determination of the axial orientations in a single 
crystal and the plotting of property values for 
anisotropic single crystals. (The basic method can also 
be adapted to produce a pole figure diagram which can 
show preferred orientation effects in polycrystalline 
aggregates.) 

A very small crystal of cubic symmetry is assumed 
to be located at the centre of a reference sphere, as 
shown in Figure 2.7a, so that the orientation of a crys- 
tal plane, such as the (111) plane marked, may be 
represented on the surface of the sphere by the point 
of intersection, or pole, of its normal P. The angle 0 
between the two poles (001) and (1 1 1), shown in 
Figure 2.7b, can then be measured in degrees along 
the arc of the great circle between the poles P and P r . 
To represent all the planes in a crystal in this three- 
dimensional way is rather cumbersome; in the stereo- 
graphic projection, the array of poles which represents 
the various planes in the crystal is projected from the 
reference sphere onto the equatorial plane. The pattern 
of poles projected on the equatorial, or primitive, plane 
then represents the stereographic projection of the crys- 
tal. As shown in Figure 2.7c, poles in the northern half 
of the reference sphere are projected onto the equa- 
torial plane by joining the pole P to the south pole 
S, while those in the southern half of the reference 
sphere, such as Q, are projected in the same way in the 
direction of the north pole N. Figure 2.8a shows the 


stereographic projection of some simple cubic planes, 
{1 00), {1 1 0} and {1 1 1), from which it can be seen 
that those crystallographic planes which have poles in 
the southern half of the reference sphere are repre- 
sented by circles in the stereogram, while those which 
have poles in the northern half are represented by dots. 

As shown in Figure 2.7b, the angle between two 
poles on the reference sphere is the number of degrees 
separating them on the great circle passing through 
them. The angle between P and P' can be determined 
by means of a hemispherical transparent cap graduated 
and marked with meridian circles and latitude circles, 
as in geographical work. With a stereographic rep- 
resentation of poles, the equivalent operation can be 
performed in the plane of the primitive circle by using 
a transparent planar net, known as a Wulff net. This net 
is graduated in intervals of 2°, with meridians in the 
projection extending from top to bottom and latitude 
lines from side to side. 1 Thus, to measure the angular 
distance between any two poles in the stereogram, the 
net is rotated about the centre until the two poles lie 
upon the same meridian, which then corresponds to 
one of the great circles of the reference sphere. The 
angle between the two poles is then measured as the 
difference in latitude along the meridian. Some useful 
crystallographic rules may be summarized: 

1. The Weiss Zone Law: the plane (hkl) is a member 
of the zone [uvw] if hu + kv + Iw = 0. A set of 
planes which all contain a common direction [uvw] 
is known as a zone; [uvw] is the zone axis (rather 
like the spine of an open book relative to the flat 
leaves). For example, the three planes (1 1 0), (Ol 1) 
and (101) form a zone about the [111] direction 
(Figure 2.8a). The pole of each plane containing 
[uvw] must lie at 90° to [uvw]; therefore these three 
poles all lie in the same plane and upon the same 
great circle trace. The latter is known as the zone 
circle or zone trace. A plane trace is to a plane as 
a zone circle is to a zone. Uniquely, in the cubic 

! A less-used alternative to the Wulff net is the polar net, in 
which the N-S axis of the reference sphere is perpendicular 
to the equatorial plane of projection. 



Figure 2.7 Principles of stereo graphic projection, illustrating (a) the pole P to a (1 1 1) plane, (b) the angle between two 
poles, P, P' and (c) stereographic projection of P and P' poles to the (1 1 1) and (001) planes, respectively . 
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Figure 2.8 Projections of planes in cubic crystals: (a) standard (001) stereographic projection and (b) spherical projection. 


system alone, zone circles and plane traces with the 
same indices lie on top of one another. 

2. If a zone contains (h\k\l\) and ( h 2 k 2 l 2 ) it 
also contains any linear combination of them, 
e.g. m(h\kili) + n(h 2 k 2 l 2 )- For example, the 
zone [111] contains (110) and (Oil) and it 
must therefore contain (1 TO) + (0 1 1) = (1 Ol), 
(110) + 2(0 1 1) = (1 1 2), etc. The same is true 
for different directions in a zone, provided that the 
crystal is cubic. 

3. The Law of Vector Addition: the direction 
[u\ViW\] + \u 2 V 2 W 2 ] lies between [u\V\W\\ and 
[, W2V2W2] . 

4. The angle between two directions is given by: 

U\U2 + V\ V 2 + W\W2 

cos 0 = — 

\J[(u] + v\ + w\)(u\ + v\ + wj)] 

where u\v x w\ and u 2 v 2 w 2 are the indices for the 
two directions. Provided that the crystal system is 
cubic, the angles between planes may be found by 
substituting the symbols h , k , / and for u , v, w in 
this expression. 

When constructing the standard stereogram of any 
crystal it is advantageous to examine the symmetry 
elements of that structure. As an illustration, consider 
a cubic crystal, since this has the highest symme- 
try of any crystal class. Close scrutiny shows that 
the cube has thirteen axes of symmetry; these axes 
comprise three fourfold (tetrad) axes, four threefold 
(triad) axes and six twofold (diad) axes, as indicated in 
Figure 2.9a. (This diagram shows the standard square, 
triangular and lens- shaped symbols for the three types 
of symmetry axis.) An n-fold axis of symmetry oper- 
ates in such a way that after rotation through an angle 
2jT/n, the crystal comes into an identical or self- 
coincident position in space. Thus, a tetrad axis passes 
through the centre of each face of the cube parallel to 
one of the edges, and a rotation of 90° in either direc- 
tion about one of these axes turns the cube into a new 



(1 of 6) 



Figure 2.9 Some elements of symmetry for the cubic system; 
total number of elements = 23 . 


position which is crystallographically indistinguishable 
from the old position. Similarly, the cube diagonals 
form a set of four threefold axes, and each of the lines 
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passing through the centre of opposite edges form a set 
of six twofold symmetry axes. Some tetrad, triad and 
diad axes are marked on the spherical projection of a 
cubic crystal shown in Figure 2.8b. The cube also has 
nine planes of symmetry (Figure 2.9b) and one centre 
of symmetry, giving, together with the axes, a total of 
23 elements of symmetry. 

In the stereographic projection of Figure 2.8a, 
planes of symmetry divide the stereogram into 24 
equivalent spherical triangles, commonly called unit 
triangles, which correspond to the 48 (24 on the top 
and 24 on the bottom) seen in the spherical projection. 
The two-, three- and fourfold symmetry about the 
{110}, {111} and {100} poles, respectively, is 
apparent. It is frequently possible to analyse a problem 
in terms of a single unit triangle. Finally, reference 
to a stereogram (Figure 2.8a) confirms rule (2) which 
states that the indices of any plane can be found 
merely by adding simple multiples of other planes 
which lie in the same zone. For example, the (Oil) 
plane lies between the (001) and (010) planes and 
clearly 011 = 001+010. Owing to the action of the 
symmetry elements, it can be reasoned that there must 
be a total of 12 {0 1 1} planes because of the respective 
three- and fourfold symmetry about the {111} and 
{100} axes. As a further example, it is clear that the 
(112) plane lies between the (001) plane and (111) 
plane since 112 = 001 + 111 and that the {1 1 2} 
form must contain 24 planes, i.e. a icositetrahedron. 
The plane (12 3), which is an example of the most 
general crystal plane in the cubic system because its 
hkl indices are all different, lies between (112) and 
(Oil) planes; the 48 planes of the {12 3} form make 
up a hexak-isoctahedron. 

The tetrahedral form, a direct derivative of the 
cubic form, is often encountered in materials science 
(Figure 2.10a). Its symmetry elements comprise four 
triad axes, three diad axes and six ‘mirror’ planes, as 
shown in the stereogram of Figure 2.10b. 

Concepts of symmetry, when developed systemat- 
ically, provide invaluable help in modem structural 
analysis. As already implied, there are three basic ele- 
ments, or operations, of symmetry. These operations 
involve translation (movement along parameters a , b, 
c), rotation (about axes to give diads, triads, etc.) and 
reflection (across ‘mirror’ planes). Commencing with 
an atom (or group of atoms) at either a lattice point or 
at a small group of lattice points, a certain combina- 
tion of symmetry operations will ultimately lead to the 
three-dimensional development of any type of crystal 
structure. The procedure provides a unique identifying 
code for a stmcture and makes it possible to locate 
it among 32 point groups and 230 space groups of 
symmetry. This classification obviously embraces the 
seven crystal systems. Although many metallic struc- 
tures can be defined relatively simply in terms of space 
lattice and one or more lattice constants, complex 
structures require the key of symmetry theory. 




2.5 Selected crystal structures 

2.5.1 Pure metals 

We now examine the crystal structures of various 
elements (metallic and non-metallic) and compounds, 
using examples to illustrate important structure- 
building principles and structure/property relations. 1 
Most elements in the Periodic Table are metallic in 
character; accordingly, we commence with them. 

Metal ions are relatively small, with diameters in the 
order of 0.25 nm. A millimetre cube of metal therefore 
contains about 10 20 atoms. The like ions in pure solid 
metal are packed together in a highly regular manner 
and, in the majority of metals, are packed so that ions 
collectively occupy the minimum volume. Metals are 
normally crystalline and for all of them, irrespective 
of whether the packing of ions is close or open, it 

1 Where possible, compound structures of engineering 
importance have been selected as illustrative examples. 
Prototype structures, such as NaCl, ZnS, CaF 2 , etc., which 
appear in standard treatments elsewhere, are indicated as 
appropriate. 
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Figure 2.11 Arrangement of atoms in (a) face-centred cubic structure, (b) close-packed hexagonal structure, and 
(c) body-centred cubic structure. 


is possible to define and express atomic arrangements 
in terms of structure cells (Section 2.2). Furthermore, 
because of the non-directional nature of the metallic 
bond, it is also possible to simulate these arrangements 
by simple 4 hard- sphere’ modelling. 

There are two ways of packing spheres of equal 
size together so that they occupy the minimum vol- 
ume. The structure cells of the resulting arrangements, 
face-centred cubic (fee) and close-packed hexagonal 
(eph), are shown in Figures 2.1 la and 2.1 lb. The other 
structure cell (Figure 2.11c) has a body-centred cubic 
(bcc) arrangement; although more ‘open’ and not based 
on close-packing, it is nevertheless adopted by many 
metals. 

In order to specify the structure of a particular metal 
completely, it is necessary to give not only the type 
of crystal structure adopted by the metal but also the 
dimensions of the structure cell. In cubic structure cells 
it is only necessary to give the length of an edge a , 
whereas in a hexagonal cell the two parameters a and 
c must be given, as indicated in Figures 2.11a-c. If a 
hexagonal structure is ideally close-packed, the ratio 
cl a must be 1.633. In hexagonal metal structures, the 
axial ratio cl a is never exactly 1.633. These structures 
are, therefore, never quite ideally closed-packed, e.g. 
cl a (Zn) = 1.856, c/a(Ti) = 1.587. As the axial ratio 
approaches unity, the properties of eph metals begin 
to show similarities to fee metals. 

A knowledge of cell parameters permits the atomic 
radius r of the metal atoms to be calculated on the 
assumption that they are spherical and that they are 
in closest possible contact. The reader should verify 
that in the fee structure r = (ay/ 2)/4 and in the bcc 
structure r = (ay/3)/ 4, where a is the cell parameter. 


The coordination number (CN), an important con- 
cept in crystal analysis, is defined as the number of 
nearest equidistant neighbouring atoms around any 
atom in the crystal structure. Thus, in the bcc struc- 
ture shown in Figure 2.11c the atom at the centre of 
the cube in surrounded by eight equidistant atoms, 
i.e. CN = 8. It is perhaps not so readily seen from 
Figure 2.11a that the coordination number for the fee 
structure is 12. Perhaps the easiest method of visu- 
alizing this is to place two fee cells side by side, 
and then count the neighbours of the common face- 
centring atom. In the eph structure with ideal packing 
(c/a = 1.633) the coordination number is again 12, as 
can be seen by once more considering two cells, one 
stacked on top of the other, and choosing the centre 
atom of the common basal plane. This (0 001) basal 
plane has the densest packing of atoms and has the 
same atomic arrangement as the closest-packed plane 
in the fee structure. 1 

The eph and fee structures represent two effective 
methods of packing spheres closely; the difference 
between them arises from the different way in which 
the close-packed planes are stacked. Figure 2.12a 
shows an arrangement of atoms in A-sites of a close- 
packed plane. When a second plane of close-packed 
atoms is laid down, its first atom may be placed in 
either a B-site or a C-site, which are entirely equiva- 
lent. However, once the first atom is placed in one of 
these two types of site, all other atoms in the second 

! The Miller indices for the closest-packed (octahedral) 
planes of the fee structure are {1 1 1}; these planes are best 
revealed by balancing a ball-and-stick model of the fee cell 
on one corner. 
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Figure 2.12 (a) Arrangements of atoms in a close-packed plane, (b) registry of two close-packed planes, and (c) the stacking 
of successive planes. 


Table 2.1 Crystal structures of some metals at room temperature 


Element 

Crystal structure 

Closest interatomic 
distance (nm) 

Element 

Crystal structure 

Closest interatomic 
distance (nm) 

Aluminium 

fee 


0.286 

Platinum 

fee 

0.211 

Beryllium 

cph (c/a = 

1.568) 

0.223 

Potassium 

bcc 

0.461 

Cadmium 

cph (c/a = 

1.886) 

0.298 

Rhodium 

fee 

0.269 

Chromium 

bcc 


0.250 

Rubidium 

bcc 

0.494 

Cobalt 

cph (c/a = 

1.623) 

0.250 

Silver 

fee 

0.289 

Copper 

fee 


0.255 

Sodium 

bcc 

0.372 

Gold 

fee 


0.288 

Tantalum 

bcc 

0.286 

Iron 

bcc 


0.248 

Thorium 

fee 

0.360 

Lead 

fee 


0.350 

Titanium 

cph (c/a = 1.587) 

0.299 

Lithium 

bcc 


0.331 

Tungsten 

bcc 

0.274 

Magnesium 

cph (c/a = 

1.623) 

0.320 

Uranium 

orthorhombic 

0.275 

Molybdenum 

bcc 


0.275 

Vanadium 

bcc 

0.262 

Nickel 

fee 


0.249 

Zinc 

cph (c/a = 1.856) 

0.266 

Niobium 

bcc 


0.286 

Zirconium 

cph (c/a = 1.592) 

0.318 


plane must be in similar sites. (This is because neigh- 
bouring B- and C sites are too close together for both 
to be occupied in the same layer.) At this stage there 
is no difference between the cph and fee structure; the 
difference arises only when the third layer is put in 
position. In building up the third layer, assuming that 
sites of type B have been used to construct the second 
layer, as shown in Figure 2.12b, either A-sites or C- 
sites may be selected. If A-sites are chosen, then the 
atoms in the third layer will be directly above those in 
the first layer, and the structure will be cph, whereas 
if C-sites are chosen this will not be the case and the 
structure will be fee. Thus a cph structure consists of 
layers of close-packed atoms stacked in the sequence 
of ABABAB or, of course, equally well, AC AC AC. 
An fee structure has the stacking sequence ABCAB- 
CABC so that the atoms in the fourth layer lie directly 
above those in the bottom layer. The density of packing 
within structures is sometimes expressed as an atomic 
packing fraction (APF) which is the fraction of the cell 
volume occupied by atoms. The APF value for a bcc 
cell is 0.68; it rises to 0.74 for the more closely packed 
fee and cph cells. 

Table 2.1 gives the crystal structures adopted by 
some typical metals, the majority of which are either 


fee or bcc. As indicated previously, an atom does not 
have precise dimensions; however, it is convenient to 
express atomic diameters as the closest distance of 
approach between atom centres. Table 2.1 lists struc- 
tures that are stable at room temperature; at other 
temperatures, some metals undergo transition and the 
atoms rearrange to form a different crystal structure, 
each structure being stable over a definite interval of 
temperature. This phenomenon is known as allotropy. 
The best-known commercially-exploitable example is 
that of iron, which is bcc at temperatures below 910°C, 
fee in the temperature range 910-1 400° C and bcc at 
temperatures between 1400°C and the melting point 
(1535°C). Other common examples include titanium 
and zirconium which change from cph to bcc at tem- 
peratures of 882°C and 815°C, respectively, tin, which 
changes from cubic (grey) to tetragonal (white) at 
13.2°C, and the metals uranium and plutonium. Pluto- 
nium is particularly complex in that it has six different 
allotropes between room temperature and its melting 
point of 640° C. 

These transitions between allotropes are usually 
reversible and, because they necessitate rearrangement 
of atoms, are accompanied by volume changes and 
either the evolution or absorption of thermal energy. 
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The transition can be abrupt but is often sluggish. For- 
tunately, tetragonal tin can persist in a metastable state 
at temperatures below the nominal transition temper- 
ature. However, the eventual transition to the friable 
low-density cubic form can be very sudden. 1 

Using the concept of a unit cell, together with data 
on the atomic mass of constituent atoms, it is possible 
to derive a theoretical value for the density of a pure 
single crystal. The parameter a for the bcc cell of pure 
iron at room temperature is 0.286 64 nm. Hence the 
volume of the unit cell is 0.023 55 nm 3 . Contrary to 
first impressions, the bcc cell contains two atoms, i.e. 
(8 x | atom) + 1 atom. Using the Avogadro constant 
V A , 2 we can calculate the mass of these two atoms as 
2(55.85/A/a) or 185.46 x 10 -24 kg, where 55.85 is the 
relative atomic mass of iron. The theoretical density 
(mass/volume) is thus 7875 kg m -3 . The reason for 
the slight discrepancy between this value and the 
experimentally-determined value of 7870 kg m -3 will 
become evident when we discuss crystal imperfections 
in Chapter 4. 

2.5.2 Diamond and graphite 

It is remarkable that a single element, carbon, can exist 
in two such different crystalline forms as diamond 
and graphite. Diamond is transparent and one of the 

historical examples of ‘tin plague’ abound (e.g. buttons, 
coins, organ pipes, statues). 

2 The Avogadro constant Na is 0.602217 x 10 -24 mol -1 . 
The mole is a basic SI unit. It does not refer to mass and 
has been likened to terms such as dozen, score, gross, etc. 
By definition, it is the amount of substance which contains 
as many elementary units as there are atoms in 0.012 kg of 
carbon- 12. The elementary unit must be specified and may 
be an atom, a molecule, an ion, an electron, a photon, etc. 
or a group of such entities. 


hardest materials known, finding wide use, notably as 
an abrasive and cutting medium. Graphite finds general 
use as a solid lubricant and writing medium (pencil 
‘lead’). It is now often classed as a highly refractory 
ceramic because of its strength at high temperatures 
and excellent resistance to thermal shock. 

We can now progress from the earlier representation 
of the diamond structure (Figure 1.3c) to a more real- 
istic version. Although the structure consists of two 
interpenetrating fee sub-structures, in which one sub- 
structure is slightly displaced along the body diagonal 
of the other, it is sufficient for our purpose to concen- 
trate on a representative structure cell (Figure 2.13a). 
Each carbon atom is covalently bonded to four equidis- 
tant neighbours in regular tetrahedral 3 coordination 
(CN = 4). For instance, the atom marked X occupies a 
‘hole’, or interstice, at the centre of the group formed 
by atoms marked 1, 2, 3 and 4. There are eight equiv- 
alent tetrahedral sites of the X-type, arranged four- 
square within the fee cell; however, in the case of 
diamond, only half of these sites are occupied. Their 
disposition, which also forms a tetrahedron, maximizes 
the intervening distances between the four atoms. If the 
fee structure of diamond depended solely upon pack- 
ing efficiency, the coordination number would be 12; 
actually CN = 4, because only four covalent bonds can 
form. Silicon (Z = 14), germanium (Z = 32) and grey 
tin (Z = 50) are fellow-members of Group IV in the 
Periodic Table and are therefore also tetravalent. Their 
crystal structures are identical in character, but obvi- 
ously not in dimensions, to the diamond structure of 
Figure 2.13a. 

3 The stability and strength of a tetrahedral form holds a 
perennial appeal for military engineers: spiked iron caltrops 
deterred attackers in the Middle Ages and concrete 
tetrahedra acted as obstacles on fortified Normandy beaches 
in World War II. 



Figure 2.13 Two crystalline forms of carbon: (a) diamond and (b) graphite (from Kingery, Bowen and Uhlmann, 1976; by 
permission of Wiley-Interscience ) . 
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Graphite is less dense and more stable than dia- 
mond. In direct contrast to the cross-braced structure of 
diamond, graphite has a highly anisotropic layer struc- 
ture (Figure 2.13b). Adjacent layers in the ABABAB 
sequence are staggered; the structure is not cph. A 
less stable rhombohedral ABCABC sequence has been 
observed in natural graphite. Charcoal, soot and lamp- 
black have been termed ‘amorphous carbon’; actually 
they are microcrystalline forms of graphite. Covalent- 
bonded carbon atoms, 0.1415 nm apart, are arranged 
in layers of hexagonal symmetry. These layers are 
approximately 0.335 nm apart. This distance is rel- 
atively large and the interlayer forces are therefore 
weak. Layers can be readily sheared past each other, 
thus explaining the lubricity of graphitic carbon. (An 
alternative solid lubricant, molybdenum disulphide, 
MoS 2 , has a similar layered structure.). 

The ratio of property values parallel to the a - axis 
and the c-axis is known as the anisotropy ratio. (For 
cubic crystals, the ratio is unity.) Special synthesis 
techniques can produce near-ideal graphite 1 with an 
anisotropy ratio of thermal conductivity of 200. 

2.5.3 Coordination in ionic crystals 

We have seen in the case of diamond how the joining 
of four carbon atoms outlines a tetrahedron which is 
smaller than the structure cell (Figure 2.13a). Before 
examining some selected ionic compounds, it is neces- 
sary to develop this aspect of coordination more fully. 
This approach to structure-building concerns packing 
and is essentially a geometrical exercise. It is sub- 
ordinate to the more dominant demands of covalent 
bonding. 

In the first of a set of conditional rules, assembled by 
Pauling, the relative radii of cation (r) and anion ( R ) 
are compared. When electrons are stripped from the 
outer valence shell during ionization, the remaining 

applications range from rocket nozzles to bowl linings for 
tobacco pipes. 


electrons are more strongly attracted to the nucleus; 
consequently, cations are usually smaller than anions. 
Rule 1 states that the coordination of anions around 
a reference cation is determined by the geometry 
necessary for the cation to remain in contact with 
each anion. For instance, in Figure 2.14a, a radius 
ratio r/R of 0.155 signifies touching contact when 
three anions are grouped about a cation. This critical 
value is readily derived by geometry. If the r/R ratio 
for threefold coordination is less than 0.155 then the 
cation ‘rattles’ in the central interstice, or ‘hole’, and 
the arrangement is unstable. As r/R exceeds 0.155 then 
structural distortion begins to develop. 

In the next case, that of fourfold coordination, 
the ‘touching’ ratio has a value of 0.225 and 
joining of the anion centres defines a tetrahedron 
(Figure 2.14b). For example, silicon and oxygen ions 
have radii of 0.039 nm and 0.132 nm, respectively, 
hence r/R = 0.296. This value is slightly greater than 
the critical value of 0.225 and it follows that tetrahedral 
coordination gives a stable configuration; indeed, the 
complex anion SiCL 4- is the key structural feature 
of silica, silicates and silica glasses. The quadruple 
negative charge is due to the four unsatisfied oxygen 
bonds which project from the group. 

In a feature common to many structures, the 
tendency for anions to distance themselves from each 
other as much as possible is balanced by their attraction 
towards the central cation. Each of the four oxygen 
anions is only linked by one of its two bonds to 
the silicon cation, giving an effective silicon/oxygen 
ratio of 1:2 and thus confirming the stoichiometric 
chemical formula for silica, Si0 2 . Finally, as shown in 
Figure 2.14c, the next coordination polyhedron is an 
octahedron for which r/R = 0.414. It follows that each 
degree of coordination is associated with a nominal 
range of r/R values, as shown in Table 2.2. Caution 
is necessary in applying these ideas of geometrical 
packing because (1) range limits are approximative, 
(2) ionic radii are very dependent upon CN, (3) ions 
can be non-spherical in anisotropic crystals and 



r/R = 0.155 
CN = 3 

(Boric oxide B 2 0 3 ) 



(Silica Si0 2 ) 


(c) 



r/R = 0.414 
CN = 6 

(Periclase MgO) 


Figure 2.14 Nesting of cations within anionic groups. 
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Table 2.2 Relation between radius ratio and coordination 


r/R 

Maximum 
coordination 
number (CN) 

Form of 
coordination 

<0.155 

2 

Linear 

0.155-0.225 

3 

Equilateral triangle 

0.225-0.414 

4 

Regular tetrahedron 

0.414-0.732 

6 

Regular octahedron 

0.732-1.0 

8 

Cube 

1.00 

12 

Cuboctahedron 


(4) considerations of covalent or metallic bonding can 
be overriding. The other four Pauling rules are as 
follows: 

Rule II. In a stable coordinated structure the total 
valency of the anion equals the summated bond 
strengths of the valency bonds which extend to this 
anion from all neighbouring cations. Bond strength is 
defined as the valency of an ion divided by the actual 
number of bonds; thus, for Si 4+ in tetrahedral coordi- 
nation it is | = 1. This valuable rule, which expresses 
the tendency of each ion to achieve localized neutrality 
by surrounding itself with ions of opposite charge, is 
useful in deciding the arrangement of cations around 
an anion. For instance, the important ceramic barium 
titanate (BaTi0 3 ) has Ba 2+ and Ti 4+ cations bonded 
to a common 0 2 ~ anion. Given that the coordination 
numbers of O 2- polyhedra centred on Ba 2+ and Ti 4+ 
are 12 and 6, respectively, we calculate the correspond- 
ing strengths of the B a- O and Ti-0 bonds as ^ \ 

and | = | . The valency of the shared anion is 2, which 
is numerically equal to (4x^) + (2x|). Accord- 
ingly, coordination of the common oxygen anion with 
four barium cations and two titanium cations is a viable 
possibility. 

Rule III. An ionic structure tends to have maxi- 
mum stability when its coordination polyhedra share 
corners; edge- and face-sharing give less stability. Any 
arrangement which brings the mutually-repelling cen- 
tral cations closer together tends to destabilize the 
structure. Cations of high valency (charge) and low 
CN (poor ‘shielding’ by surrounding anions) aggravate 
the destabilizing tendency. 

Rule IV. In crystals containing different types of 
cation, cations of high valency and low CN tend to 
limit the sharing of polyhedra elements; for instance, 
such cations favour corner- sharing rather than edge- 
sharing. 

Rule V. If several alternative forms of coordination 
are possible, one form usually applies throughout the 
structure. In this way, ions of a given type are more 
likely to have identical surroundings. 

In conclusion, it is emphasized that the Pauling rules 
are only applicable to structures in which ionic bonding 
predominates. Conversely, any structure which fails to 
comply with the rules is extremely unlikely to be ionic. 



Figure 2.15 Zinc blende (a-ZnS) structure, prototype for 
cubic boron nitride (BN) (from Kingery, Bowen and 
Uhlmann, 1976; by permission of Wiley-lnterscience ) . 

The structure of the mineral zinc blende (a-ZnS) 
shown in Figure 2.15 is often quoted as a prototype 
for other structures. In accord with the radius ratio 
r/R = 0.074/0.184 = 0.4, tetrahedral coordination is 
a feature of its structure. Coordination tetrahedra 
share only corners (vertices). Thus one species of ion 
occupies four of the eight tetrahedral sites within the 
cell. These sites have been mentioned previously in 
connection with diamond (Section 2.5.2); in that case, 
the directional demands of the covalent bonds between 
like carbon atoms determined their location. In zinc 
sulphide, the position of unlike ions is determined by 
geometrical packing. Replacement of the Zn 2+ and 
S 2_ ions in the prototype cell with boron and nitrogen 
atoms produces the structure cell of cubic boron nitride 
(BN). This compound is extremely hard and refractory 
and, because of the adjacency of boron (Z = 5) and 
nitrogen (Z = 7) to carbon (Z = 6) in the Periodic 
Table, is more akin in character to diamond than to 
zinc sulphide. Its angular crystals serve as an excellent 
grinding abrasive for hardened steel. The precursor for 
cubic boron nitride is the more common and readily- 
prepared form, hexagonal boron nitride. 1 

This hexagonal form is obtained by replacing 
the carbon atoms in the layered graphite structure 
(Figure 2.13b) alternately with boron and nitrogen 
atoms and also slightly altering the stacking registry 
of the layer planes. It feels slippery like graphite and 

lr The process for converting hexagonal BN to cubic BN 
( Borazon ) involves very high temperature and pressure and 
was developed by Dr R. H. Wentorf at the General Electric 
Company, USA (1957). 
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is sometimes called ‘white graphite’. Unlike graphite, 
it is an insulator, having no free electrons. 

Another abrasive medium, silicon carbide (SiC), can 
be represented in one of its several crystalline forms 
by the zinc blende structure. Silicon and carbon are 
tetravalent and the coordination is tetrahedral, as would 
be expected. 

2.5.4 AB-type compounds 

An earlier diagram (Figure 1.3b) schematically por- 
trayed the ionic bonding within magnesium oxide (per- 
iclase). We can now develop a more realistic model of 
its structure and also apply the ideas of coordination. 



O = Mg 2+ O O 2 ' 

Magnesia 

MgO 

fee 

(CN = 6:6) 



CuZn 

Primitive cubic 
(CN = 8:8) 

Figure 2.16 AB-type compounds ( from Kingery, Bowen and 
Uhlmann, 1976; by permission of Wiley -Inter science ) . 


Genetically, MgO is a sodium chloride-type struc- 
ture (Figure 2.16a), with Mg 2+ cations and O 2- anions 
occupying two interpenetrating 1 fee sub-lattices. Many 
oxides and halides have this type of structure (e.g. 
CaO, SrO, BaO, VO, CdO, MnO, FeO, CoO, NiO; 
NaCl, NaBr, Nal, NaF, KC1, etc.). The ratio of ionic 
radii r/R = 0.065/0.140 = 0.46 and, as indicated by 
Table 2.2, each Mg 2+ cation is octahedrally coordi- 
nated with six larger O 2- anions, and vice versa 
(CN = 6:6). Octahedra of a given type share edges. 
The ‘molecular’ formula MgO indicates that there is 
an exact stoichiometric balance between the numbers 
of cations and anions; more specifically, the unit cell 
depicted contains (8x | ) + (6 x \) = 4 cations and 
(12 x | ) + 1 =4 anions. 

The second example of an AB-type compound 
is the hard intermetallic compound CuZn (/3-brass) 
shown in Figure 2.16b. It has a caesium chloride- 
type structure in which two simple cubic sub-lattices 
interpenetrate. Copper (Z = 29) and zinc (Z = 30) 
have similar atomic radii. Each copper atom is in 
eightfold coordination with zinc atoms; thus CN = 
8:8. The coordination cubes share faces. Each unit 
cell contains (8 x |) = 1 corner atom and 1 central 
atom; hence the formula CuZn. In other words, this 
compound contains 50 at.% copper and 50 at.% zinc. 

2.5.5 Silica 

Compounds of the AB 2 -type (stoichiometric ratio 
1:2) form a very large group comprising many 
different types of structure. We will concentrate upon 
/3-cristobalite, which, as Table 2.3 shows, is the high- 
temperature modification of one of the three principal 
forms in which silica (Si0 2 ) exists. Silica is a 
refractory ceramic which is widely used in the steel 
and glass industries. Silica bricks are prepared by kiln- 
firing quartz of low impurity content at a temperature 
of 1450°C, thereby converting at least 98.5% of it 
into a mixture of the more ‘open’, less dense forms, 
tridymite and cristobalite. The term ‘conversion’ is 
equivalent to that of allotropic transformation in 
metallic materials and refers to a transformation which 
is reconstructive in character, involving the breaking 
and re-establishment of inter-atomic bonds. These 
solid-state changes are generally rather sluggish and, 
as a consequence, crystal structures frequently persist 
in a metastable condition at temperatures outside 
the nominal ranges of stability given in Table 2.3. 
Transformations from one modification to another only 
involve displacement of bonds and reorientation of 
bond directions; they are known as inversions. As 
these changes are comparatively limited in range, 
they are usually quite rapid and reversible. However, 
the associated volume change can be substantial. For 
example, the a -> /3 transition in cristobalite at a 

1 Sub-lattices can be discerned by concentrating on each 
array of like atoms (ions) in turn. 
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Table 2.3 Principal crystalline forms of silica 


Form 

Range of stability (°C) 

Modifications 

Density (kg m 3 ) 

Cristobalite 

1470-1723 (m.p.) 

f> — (cubic) 

2210 



a — (tetragonal) 

2330 

Tridymite 

870-1470 

y— (?) 

— 



f> — (hexagonal) 

2300 



a — (orthorhombic) 

2270 

Quartz 

<870 

f> — (hexagonal) 

2600 



a — (trigonal) 

2650 


temperature of 270° C is accompanied by a volume 
increase of 3% which is capable of disrupting the 
structure of a silica brick or shape. In order to avoid 
this type of thermal stress cracking, it is necessary 
to either heat or cool silica structures very slowly at 
temperatures below 700°C (e.g. at 20°Ch _1 ). Above 
this temperature level, the structure is resilient and, as 
a general rule, it is recommended that silica refractory 
be kept above a temperature of 700° C during its 
entire working life. Overall, the structural behaviour 
of silica during kiln-firing and subsequent service is 
a complicated subject, 1 particularly as the presence 
of other substances can either catalyse or hinder 
transformations. 

Substances which promote structural change in 
ceramics are known as mineralizers (e.g. calcium 
oxide (CaO)). The opposite effect can be produced 
by associated substances in the micro structure; for 
instance, an encasing envelope of glassy material 
can inhibit the cooling inversion of a small volume 
of /3-cristobalite by opposing the associated contrac- 
tion. The pronounced metastability of cristobalite and 
tridymite at relatively low temperatures is usually 
attributed to impurity atoms which, by their pres- 
ence in the interstices, buttress these ‘open’ structures 
and inhibit conversions. However, irrespective of these 
complications, corner-sharing Si0 4 4- tetrahedra, with 
their short-range order, are a common feature of all 
these crystalline modifications of silica; the essential 
difference between modifications is therefore one of 
long-range ordering. We will use the example of the 
/3-cristobalite structure to expand the idea of these ver- 
satile tetrahedral building units. (Later we will see that 
they also act as building units in the very large family 
of silicates.) 

In the essentially ionic structure of ^-cristobalite 
(Figure 2.17) small Si 4+ cations are located in a cubic 
arrangement which is identical to that of diamond. The 
much larger 0 2 ~ anions form SiCL 4- tetrahedra around 
each of the four occupied tetrahedral sites in such a 
way that each Si 4+ lies equidistant between two anions. 

l Th e fact that cristobalite forms at a kiln-firing temperature 

which is below 1470° C illustrates the complexity of the 

structural behaviour of commercial-quality silica. 



Figure 2.17 Structure of f>- cristobalite (from Kingery, 

Bowen and Uhlmann, 1976; by permission of 
Wiley-Interscience). 

The structure thus forms a regular network of comer- 
sharing tetrahedra. The coordination of anions around 
a cation is clearly fourfold; coordination around each 
anion can be derived by applying Pauling’s Rule III. 
Thus, CN = 4:2 neatly summarizes the coordination 
in /3-cristobalite. Oxygen anions obviously occupy 
much more volume than cations and consequently their 
grouping in space determines the essential character 
of the structure. In other words, the radius ratio is 
relatively small. As the anion and cation become 
progressively more similar in size in some of the other 
AB 2 -type compounds, the paired coordination numbers 
take values of 6:3 and then 8:4. These paired values 
relate to structure groups for which rutile (Ti0 2 ) and 
fluorite (CaF 2 ), respectively, are commonly quoted 
as prototypes. AB 2 -type compounds have their alloy 
counterparts and later, in Chapter 3, we will examine 
in some detail a unique and important family of alloys 
(e.g. MgCu 2 , MgNi 2 , MgZn 2 , etc.). In these so-called 
Laves phases, two dissimilar types of atoms pack so 
closely that the usual coordination maximum of 12, 
which is associated with equal-sized atoms, is actually 
exceeded. 
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Figure 2.18 Structure of a-alumina ( corundum ) viewed 
perpendicular to (0 0 0 1) basal plane ( from Hume-Rothery, 
Smallman and Haworth, 1988). 


2.5.6 Alumina 

Alumina exists in two forms: a-A\ 2 0 3 and y- A1 2 0 3 . 
The former, often referred to by its mineral name 
corundum, serves as a prototype for other ionic oxides, 
such as (haematite), Cr 2 0 3 , V 2 O 3 , Ti 2 0 3 , 

etc. The structure of q'-A1 2 0 3 (Figure 2.18) can be 
visualized as layers of close-packed O 2- anions with 
an ABABAB . . . sequence in which two-thirds of the 
octahedral holes or interstices are filled symmetrically 
with smaller Al 3+ cations. Coordination is accordingly 
6:4. This partial filling gives the requisite stoichiomet- 
ric ratio of ions. The structure is not truly cph because 
all the octahedral sites are not filled. 

q'-A 2 0 3 is the form of greatest engineering inter- 
est. The other term, y-Al 2 0 3 , refers collectively to a 
number of variants which have O 2- anions in an fee 
arrangement. As before, Al 3+ cations fill two-thirds of 
the octahedral holes to give a structure which is con- 
veniently regarded as a ‘defect’ spinel structure with 
a deficit, or shortage, of Al 3+ cations; spinels will be 
described in Section 2.5.7. y-Al 2 0 3 has very useful 
adsorptive and catalytic properties and is sometimes 
referred to as ‘activated alumina’, illustrating yet again 
the way in which structural differences within the same 
compound can produce very different properties. 

2.5.7 Complex oxides 

The AB0 3 -type compounds, for which the mineral 
perovskite (CaTi0 3 ) is usually quoted as prototype, 
form an interesting and extremely versatile family. 
Barium titanium oxide 1 (BaTi0 3 ) has been studied 
extensively, leading to the development of impor- 
tant synthetic compounds, notably the new genera- 
tion of ceramic superconductors . 2 It is polymorphic, 

x The structure does not contain discrete Ti0 3 2- anionic 
groups; hence, strictly speaking, it is incorrect to imply that 
the compound is an inorganic salt by referring to it as 
barium ‘titanate’. 

2 K. A. Muller and J. G. Bednorz, IBM Zurich Research 
Laboratory, based their researches upon perovskite-type 
structures. In 1986 they produced a complex 
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Figure 2.19 Unit cell of cubic BaTiOs ( CN = 6:12) (from 
Kingery, Bowen and Uhlmann, 1976; by permission of 
Wiley-Interscience). 

exhibiting at least four temperature-dependent transi- 
tions. The cubic form, which is stable at temperatures 
below 120°C, is shown in Figure 2.19. The large bar- 
ium cations are located in the ‘holes’, or interstices, 
between the regularly stacked titanium-centred oxy- 
gen octahedra. Each barium cation is at the centre of 
a polyhedron formed by twelve oxygen anions. (Coor- 
dination in this structure was discussed in terms of 
Pauling’s Rule II in Section 2.5.3). 

Above the ferroelectric Curie point (120°C), the 
cubic unit cell of BaTi0 3 becomes tetragonal as 
Ti 4+ cations and O 2- anions move in opposite 
directions parallel to an axis of symmetry. This 
slight displacement of approximately 0.005 nm is 
accompanied by a change in axial ratio (c/a) from 
unity to 1.04. The new structure develops a dipole 
of electric charge as it becomes less symmetrical; it 
also exhibits marked ferroelectric characteristics. The 
electrical and magnetic properties of perovskite-type 
structures will be explored in Chapter 6 . 

Inorganic compounds with structures similar to that 
of the hard mineral known as spinel, MgAl 2 04 , form 
an extraordinarily versatile range of materials (e.g. 
watch bearings, refractories). Numerous alternative 
combinations of ions are possible. Normal versions 
of these mixed oxides are usually represented by the 
general formula AB 2 C> 4 ; however, other combinations 
of the two dissimilar cations, A and B, are also 

super-conducting oxide of lanthanum, barium and copper 
which had the unprecedentedly-high critical temperature of 
35 K. 
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possible. Terms such as II-III spinels, II-IV spinels 
and I- VI spinels have been adopted to indicate 
the valencies of the first two elements in the 
formula; respective examples being Mg 2+ Al 2 3+ 0 4 2 “, 
Mg 2 2+ Ge 4+ 04 2_ and Ag 2 1+ Mo 6+ C> 4 2 ~. In each spinel 
formula, the total cationic charge balances the negative 
charge of the oxygen anions. (Analogous series of 
compounds are formed when the divalent oxygen 
anions are completely replaced by elements from 
the same group of the Periodic Table, i.e. sulphur, 
selenium and tellurium.) 

The principle of substitution is a useful device for 
explaining the various forms of spinel structure. 

Thus, in the case of II-III spinels, the Mg 2+ cations 
of the reference spinel structure MgAl 2 0 4 can be 
replaced by Fe 2+ , Zn 2+ , Ni 2+ and Mn 2+ and virtu- 
ally any trivalent cation can replace Al 3+ ions (e.g. 
Fe 3+ , Cr 3+ , Mn 3+ , Ti 3+ , V 3+ , rare earth ions, etc.). The 
scope for extreme diversity is immediately apparent. 
The cubic unit cell, or true repeat unit, of the II- 
III prototype MgAl 2 0 4 comprises eight fee sub-cells 
and, overall, contains 32 oxygen anions in almost per- 
fect fee arrangement. The charge-compensating cations 
are distributed among the tetrahedral (CN = 4) and 
octahedral (CN = 6) interstices of these anions. (Each 
individual fee sub-cell has eight tetrahedral sites within 
it, as explained for diamond, and 12 octahedral ‘holes’ 
located midway along each of the cube edges.) One 
eighth of the 64 tetrahedral ‘holes’ of the large unit 
cell are occupied by Mg 2+ cations and one half of the 
32 octahedral ‘holes’ are occupied by Al 3+ cations. 
A similar distribution of divalent and trivalent cations 
occurs in other normal II-III spinels e.g. MgCr 2 0 4 , 
ZnCr 2 Se 4 . Most spinels are of the II-III type. 

Ferrospinels (‘ferrites’), such as NiFe 2 0 4 and 
CoFe 2 0 4 , form an ‘inverse’ type of spinel structure 
in which the allocation of cations to tetrahedral and 
octahedral sites tends to change over, producing sig- 
nificant and useful changes in physical characteristics 
(e.g. magnetic and electrical properties). The generic 
formula for ‘inverse’ spinels takes the form B(AB)0 4 , 
with the parentheses indicating the occupancy of octa- 
hedral sites by both types of cation. In this ‘inverse’ 
arrangement, B cations rather than A cations occupy 
tetrahedral sites. In the case of the two ferrospinels 
named, ‘inverse’ structures develop during slow cool- 
ing from sintering heat-treatment. In the first spinel, 
which we can now write as Fe 3+ (Ni 2+ Fe 3+ )0 4 , half of 
the Fe 3+ cations are in tetrahedral sites. The remainder, 
together with all Ni 2+ cations, enter octahedral sites. 
Typically, these compounds respond to the conditions 
of heat-treatment: rapid cooling after sintering will 
affect the distribution of cations and produce a struc- 
ture intermediate to the limiting normal and inverse 
forms. The partitioning among cation sites is often 
quantified in terms of the degree of inversion (X) which 
states the fraction of B cations occupying tetrahedral 
sites. Hence, for normal and inverse spinels respec- 
tively, 1 = 0 and 1 = 0.5. Intermediate values of 1 


between these limits are possible. Magnetite, the nav- 
igational aid of early mariners, is an inverse spinel 
and has the formula Fe 3+ (Fe 2+ Fe 3+ )0 4 and 1 = 0.5. 
Fe 3+ (Mg 2+ Fe 3+ )0 4 is known to have a 1 value of 0.45. 
Its structure is therefore not wholly inverse, but this 
formula notation does convey structural information. 
Other, more empirical, notations are sometimes used; 
for instance, this particular spinel is sometimes repre- 
sented by the formulae MgFe 2 0 4 and Mg 0 .Fe 2 03 . 

2.5.8 Silicates 

Silicate minerals are the predominant minerals in the 
earth’s crust, silicon and oxygen being the most abun- 
dant chemical elements. They exhibit a remarkable 
diversity of properties. Early attempts to classify them 
in terms of bulk chemical analysis and concepts of 
acidity/basicity failed to provide an effective and con- 
vincing frame of reference. An emphasis upon stoi- 
chiometry led to the practice of representing silicates 
by formulae stating the thermodynamic components. 
Thus two silicates which are encountered in refrac- 
tories science, forsterite and mullite, are sometimes 
represented by the ‘molecular’ formulae 2Mg0.Si0 2 
and 3Al 2 0 3 .2Si0 2 . (A further step, often adopted in 
phase diagram studies, is to codify them as M 2 S and 
A 3 S 2 , respectively.) However, as will be shown, the 
summated counterparts of the above formulae, namely 
Mg 2 Si0 4 and Al6Si 2 0i 3 , provide some indication of 
ionic grouping and silicate type. In keeping with this 
emphasis upon structure, the characterization of ceram- 
ics usually centres upon techniques such as X-ray 
diffraction analysis, with chemical analyses making a 
complementary, albeit essential, contribution. 

The Si0 4 tetrahedron previously described in the 
discussion of silica (Section 2.5.5) provides a highly 
effective key to the classification of the numerous 
silicate materials, natural and synthetic. From each of 
the four comer anions projects a bond which is satisfied 
by either (1) an adjacent cation, such as Mg 2+ , Fe 2+ , 
Fe 3+ , Ca 2+ etc., or (2) by the formation of ‘oxygen 
bridges’ between vertices of tetrahedra. In the latter 
case an increased degree of cornersharing leads from 
stmetures in which isolated tetrahedra exist to those in 
which tetrahedra are arranged in pairs, chains, sheets 
or frameworks (Table 2.4). Let us briefly consider 
some examples of this structural method of classifying 
silicates. 

In the nesosilicates, isolated Si0 4 4 ~ tetrahedra are 
studded in a regular manner throughout the stmeture. 
Zircon (zirconium silicate) has the formula ZrSi0 4 
which displays the characteristic silicon/oxygen ratio 
(1:4) of a nesosilicate. (It is used for the refractory 
kiln furniture which supports ceramic ware during 
the firing process.) The large family of nesosilicate 
minerals known as olivines has a generic formula 
(Mg, Fe) 2 Si0 4 , which indicates that the negatively- 
charged tetrahedra are balanced electrically by either 
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Table 2.4 Classification of silicate structures 


Type of silicate 

(, Si 4+ +Al 3+ ) : 0 2 ~ a 

Arrangement 
of tetrahedra b 

Examples 

Mineralogical 

name 

Chemical 

name 

Nesosilicate 

‘Orthosilicate’ 

1:4 

Isolated ZX Z\ 

Zircon, olivines, garnets 

Sorosilicate 

‘Pyrosilicate’ 

2:7 

Pairing /\Z\ 

Thortveitite 



1:3, 4:11 

Linear chains 

Amphiboles, pyroxenes 




A v A v^ 

A A / 







Inosilicate 

‘Metasilicate’ 

3:9, 6:18, etc. 

Rings 

Beryl 

Phyllosilicate 


2:5 

4 / 

Flat sheets 

Micas, kaolin, talc 

Tecto silicate 


1:2 

Framework 

l 

Feldspars, zeolites, 





ultramarines 


a Only includes A1 cations within tetrahedra. 
b A represents a tetrahedron. 


Mg 2+ or Fe 2+ cations. This substitution, or replace- 
ment, among the available cation sites of the struc- 
ture forms a solid solution. 1 This means that the 
composition of an olivine can lie anywhere between 
the compositions of the two end-members, forsterite 
(Mg 2 Si0 4 ) and fayalite (Fe 2 Si0 4 ). The difference in 
high-temperature performance of these two varieties 
of olivine is striking; white forsterite (m.p. 1890°C) 
is a useful refractory whereas brown/black fayalite 
(m.p. 1200°C), which sometimes forms by interac- 
tion between certain refractory materials and a molten 
furnace charge, is weakening and undesirable. Substi- 
tution commonly occurs in non-metallic compounds 
(e.g. spinels). Variations in its form and extent can be 
considerable and it is often found that samples can vary 
according to source, method of manufacture, etc. Sub- 
stitution involving ions of different valency is found 

1 This important mixing effect also occurs in many metallic 

alloys; an older term, ‘mixed crystal’ (from the German 
word Mischkristall ), is arguably more appropriate. 


in the dense nesosilicates known as garnets. In their 
representational formula, A 3 n B 2 in (Si 0 4 ) 3 , the divalent 
cation A can be Ca 2+ , Mg 2+ , Mn 2+ or Fe 2+ and the 
trivalent cation B can be Al 3+ , Cr 3+ , Fe 3+ , or Ti 3+ . 
(Garnet is extremely hard and is used as an abrasive.) 

Certain asbestos minerals are important examples of 
inosilicates. Their unique fibrous character, or asbesti- 
form habit, can be related to the structural disposition 
of Si0 4 4- tetrahedra. These impure forms of mag- 
nesium silicate are remarkable for their low thermal 
conductivity and thermal stability. However, all forms 
of asbestos break down into simpler components when 
heated in the temperature range 600- 1 000° C. The 
principal source materials are: 

Amosite (brown (Fe 2 2+ Mg) 7 (Si 4 On) 2 (OH) 4 

asbestos) 

Crocidolite (blue Na 2 Fe 2 3+ (Fe 2+ Mg) 3 (SLA i h (OH) 4 
asbestos) 

Chrysotile (white Mg 3 Si 2 05 ( 0 H ) 4 

asbestos) 
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These chemical formulae are idealized. Amosite and 
crocidolite belong to the amphibole group of minerals 
in which Si0 4 4- tetrahedra are arranged in double- 
strand linear chains (Table 2.4). The term (Si 4 On) 
represents the repeat unit in the chain which is four 
tetrahedra wide. Being hydrous minerals, hydroxyl 
ions (OH) - are interspersed among the tetrahedra. 
Bands of cations separate the chains and, in a rather 
general sense, we can understand why these structures 
cleave to expose characteristic thread-like fracture 
surfaces. Each thread is a bundle of solid fibrils or 
filaments, 20-200 nm in breadth. The length/diameter 
ratio varies but is typically 100:1. Amphibole fibres are 
used for high-temperature insulation and have useful 
acid resistance; however, they are brittle and inflexible 
(‘harsh’) and are therefore difficult to spin into yarn 
and weave. In marked contrast, chrysotile fibres are 
strong and flexible and have been used specifically for 
woven asbestos articles, for friction surfaces and for 
asbestos/cement composites. Chrysotile belongs to the 
serpentine class of minerals in which Si0 4 4- tetrahedra 
are arranged in sheets or layers. It therefore appears 
paradoxical for it to have a fibrous fracture. High- 
resolution electron microscopy solved the problem by 
showing that chrysotile fibrils, sectioned transversely, 
were hollow tubes in which the structural layers were 
curved and arranged either concentrically or as scrolls 
parallel to the major axis of the tubular fibril. 

Since the 1970s considerable attention has been paid 
to the biological hazards associated with the manufac- 
ture, processing and use of asbestos-containing mate- 
rials. It has proved to be a complicated and highly 
emotive subject. Minute fibrils of asbestos are readily 
airborne and can cause respiratory diseases (asbestosis) 
and cancer. Crocidolite dust is particularly dangerous. 
Permissible atmospheric concentrations and safe han- 
dling procedures have been prescribed. Encapsulation 
and/or coating of fibres is recommended. Alternative 
materials are being sought but it is difficult to match 
the unique properties of asbestos. For instance, glassy 
‘wool’ fibres have been produced on a commercial 
scale by rapidly solidifying molten rock but they do 
not have the thermal stability, strength and flexibil- 
ity of asbestos. Asbestos continues to be widely used 
by the transportation and building industries. Asbestos 
textiles serve in protective clothing, furnace curtains, 
pipe wrapping, ablative nose cones for rockets, and 
conveyors for molten glass. Asbestos is used in friction 
components, 1 gaskets, gland packings, joints, pump 
seals, etc. In composite asbestos cloth/phenolic resin 
form, it is used for bearings, bushes, liners and aero- 
engine heat shields. Cement reinforced with asbestos 
fibres is used for roofing, cladding and for pressure 
pipes which distribute potable water. 

! Dust from asbestos friction components, such as brake 
linings, pads and clutches of cars, can contain 1-2% of 
asbestos fibres and should be removed by vacuum or damp 
cloth rather than by blasts of compressed air. 


The white mineral kaolinite is an important example 
of the many complex silicates which have a layered 
structure, i.e. Si:0 = 2:5. As indicated previously, in 
the discussion of spinels, atomic grouping(s) within the 
structural formula can indicate actual structural groups. 
Thus, kaolinite is represented by Al 2 Si 2 0 5 (0H) 4 rather 
than by Al 2 0 3 .2Si0 2 .2H 2 0, an older notation which 
uses ‘waters of crystallization’ and disregards the sig- 
nificant role of hydroxyl OH - ions. Sometimes the 
formula is written as [Al 2 Si 2 C >5 (OH) 4 ] 2 in order to give 
a truer picture of the repeat cell. Kaolinite is the com- 
monest clay mineral and its small crystals form the 
major constituent of kaolin (china-clay), the rock that 
is a primary raw material of the ceramics industry. (It 
is also used for filling and coating paper.) Clays are the 
sedimentary products of the weathering of rocks and 
when one considers the possible variety of geological 
origins, the opportunities for the acquisition of impu- 
rity elements and the scope for ionic replacement it is 
not surprising to find that the compositions and struc- 
tures of clay minerals show considerable variations. 
To quote one practical instance, only certain clays, the 
so-called fireclays, are suitable for manufacture into 
refractory firebricks for furnace construction. 

Structurally, kaolinite provides a useful insight into 
the arrangement of ions in layered silicates. Essen- 
tially the structure consists of flat layers, several 
ions thick. Figure 2.20 shows, in section, adjacent 
vertically-stacked layers of kaolinite, each layer having 
five sub-layers or sheets. The lower side of each layer 
consists of Si0 4 4- tetrahedra arranged hexagonally in a 
planar net. Three of the four vertices of these tetrahedra 
are joined by ‘oxygen bridges’ and lie in the lower- 
most face; the remaining vertices all point upwards. 
The central Si 4+ cations of the tetrahedra form the sec- 
ond sub-layer. The upward-pointing vertices, together 
with OH - ions, form the close-packed third sub-layer. 
Al 3+ cations occupy some of the octahedral ‘holes’ 
(CN = 6) between this third layer and a fifth close- 
packed layer of OH - ions. The coordination of each 




Figure 2.20 Schematic representation of two layers of 
kaolinite structure (from Evans, 1966, by permission of 
Cambridge University Press). 
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aluminium cation with two oxygen ions and four 
hydroxyl ions forms an octahedron, i.e. A10 2 (0H) 4 . 
Thus, in each layer, a sheet of Si0 4 4- tetrahedra lies 
parallel to a sheet of A 102 ( 0 H) 4 octahedra, with the 
two sheets sharing common 0 2 ~ anions. Strong ionic 
and covalent bonding exists within each layer and each 
layer is electrically neutral. However, the uneven dis- 
tribution of ionic charge across the five sub-layers has a 
polarizing effect, causing opposed changes to develop 
on the two faces of the layer. The weak van der Waals 
bonding between layers is thus explicable. This asym- 
metry of ionic structure also unbalances the bonding 
forces and encourages cleavage within the layer itself. 
In general terms, one can understand the softness, easy 
cleavage and mouldability (after moistening) of this 
mineral. The ionic radii of oxygen and hydroxyl ions 
are virtually identical. The much smaller Al 3+ cations 
are shown located outside the Si0 4 4- tetrahedra. How- 
ever, the radii ratio for aluminium and oxygen ions is 
very close to the geometrical boundary value of 0.414 
and it is possible in other aluminosilicates for Al 3+ 
cations to replace Si 4+ cations at the centres of oxygen 
tetrahedra. In such structures, ions of different valency 
enter the structure in order to counterbalance the local 
decreases in positive charge. To summarize, the coor- 
dination of aluminium in layered aluminosilicates can 
be either four- or sixfold. 

Many variations in layer structure are possible in 
silicates. Thus, talc (French chalk), Mg 3 Si 4 Oi 0 (OH) 2 , 
has similar physical characteristics to kaolinite and 
finds use as a solid lubricant. In talc, each layer con- 
sists of alternating Mg 2+ and OH - ions interspersed 
between the inwardly-pointing vertices of two sheets of 
Si0 4 4 tetrahedra. This tetrahedral-tetrahedral layering 
thus contrasts with the tetrahedral-octahedral layering 
of kaolinite crystals. 

Finally, in our brief survey of silicates, we come to 
the framework structures in which the Si0 4 4- tetrahe- 
dra share all four comers and form an extended and 
regular three-dimensional network. Feldspars, which 
are major constituents in igneous rocks, are fairly com- 
pact but other framework silicates, such as the zeolites 
and ultramarine, have unusually ‘open’ structures with 
tunnels and/or polyhedral cavities. Natural and syn- 
thetic zeolites form a large and versatile family of 
compounds. As in other framework silicates, many of 
the central sites of the oxygen tetrahedra are occupied 
by Al 3+ cations. The negatively charged framework of 
(Si, A1)0 4 tetrahedra is balanced by associated cations; 
being cross-braced in three dimensions, the structure is 
rigid and stable. The overall (Al 3+ + Si 4+ ):0 2_ ratio 
is always 1:2 for zeolites. In their formulae, (H 2 0) 
appears as a separate term, indicating that these water 
molecules are loosely bound. In fact, they can be read- 
ily removed by heating without affecting the structure 
and can also be re-absorbed. Alternatively, dehydrated 
zeolites can be used to absorb gases, such as carbon 


dioxide (C0 2 ) and ammonia (NH 3 ). Zeolites are well- 
known for their ion-exchange capacity 1 but synthetic 
resins now compete in this application. Ion exchange 
can be accompanied by appreciable absorption so that 
the number of cations entering the zeolitic structure can 
actually exceed the number of cations being replaced. 

Dehydrated zeolites have a large surface/mass ratio, 
like many other catalysts, and are used to promote 
reactions in the petrochemical industry. Zeolites can 
also serve as ‘molecular sieves’. By controlling the size 
of the connecting tunnel system within the structure, it 
is possible to separate molecules of different size from 
a flowing gaseous mixture. 

2.6 Inorganic glasses 

2.6.1 Network structures in glasses 

Having examined a selection of important crystalline 
structures, we now turn to the less-ordered glassy 
structures. Boric oxide (B 2 0 3 ; m.p. 460°C) is one of 
the relatively limited number of oxides that can exist 
in either a crystalline or a glassy state. Figure 2.1, 
which was used earlier to illustrate the concept of 
ordering (Section 2.1), portrays in a schematic man- 
ner the two structural forms of boric oxide. In this 
figure, each planar triangular group (CN = 3) repre- 
sents three oxygen anions arranged around a much 
smaller B 3+ cation. Collectively, the triangles form 
a random network in three dimensions. Similar mod- 
elling can be applied to silica (m.p. 1725°C), the most 
important and common glass-forming oxide. In silica 
glass, Si0 4 4- tetrahedra form a three-dimensional net- 
work with oxygen ‘bridges’ joining vertices. Like boric 
oxide glass, the ‘open’ structure contains many ‘holes’ 
of irregular shape. The equivalent of metallic alloying 
is achieved by basing a glass upon a combination of 
two glass-formers, silica and boric oxide. The resulting 
network consists of triangular and tetrahedral anionic 
groups and, as might be anticipated, is less cohesive 
and rigid than a pure Si0 2 network. B 2 0 3 therefore 
has a fluxing action. By acting as a network-former, it 
also has less effect upon thermal expansivity than con- 
ventional fluxes, such as Na 2 0 and K 2 0, which break 
up the network. The expansion characteristics can thus 
be adjusted by control of the B 2 0 3 /Na 2 0 ratio. 

Apart from chemical composition, the main variable 
controlling glass formation from oxides is the rate of 
cooling from the molten or fused state. Slow cooling 
provides ample time for complete ordering of atoms 
and groups of atoms. Rapid cooling restricts this physi- 
cal process and therefore favours glass formation. 2 The 

1 In the Permutite water-softening system, calcium ions in 
‘hard’ water exchange with sodium ions of a zeolite (e.g. 
thomsonite, NaCa 2 (Al 5 Si 50 2 o)). Spent zeolite is readily 
regenerated by contact with brine (NaCl) solution. 

2 The two states of aggregation may be likened to a stack of 
carefully arranged bricks (crystal) and a disordered heap of 
bricks (glass). 
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American Society for Testing and Materials (ASTM) 
defines glass as an inorganic product of fusion that has 
cooled to a rigid condition without crystallizing. The 
cooling rate can be influenced by a ‘mass effect’ with 
the chances of glass formation increasing as the size 
of particle or cross-section decreases. Accordingly, a 
more precise definition of a glass-former might also 
specify a minimum mass, say 20 mg, and free-cooling 
of the melt. As a consequence of their irregular and 
aperiodic network structures, glasses share certain dis- 
tinctive characteristics. They are isotropic and have 
properties that change gradually with changing tem- 
perature. Bond strengths vary from region to region 
within the network so that the application of stress at 
an elevated temperature causes viscous deformation or 
flow. This remarkable ability to change shape without 
fracture is used to maximum advantage in the spinning, 
drawing, rolling, pressing and blowing operations of 
the glass industry (e.g. production of filaments, tubes, 
sheets, shapes and containers). Glasses do not cleave, 
because there are no crystallographic planes, and frac- 
ture to produce new surfaces that are smooth and shell- 
like (conchoidal). It is usually impossible to represent 
a glass by a stoichiometric formula. Being essentially 
metastable, the structure of a glass can change with 
the passage of time. Raising the temperature increases 
ionic mobility and hastens this process, being some- 
times capable of inducing the nucleation and growth 
of crystalline regions within the glassy matrix. Con- 
trolled devitrification of special glasses produces the 
heat- and fracture -resistant materials known as glass- 
ceramics (Section 10.4.4). Finally, glasses lack a defi- 
nite melting point. This feature is apparent when spe- 
cific volume (m 3 kg -1 ), or a volume -related property, 
is plotted against temperature for the crystalline and 
glassy forms of a given substance (Figure 2.21). On 
cooling, the melt viscosity rapidly increases. Simul- 
taneously, the specific volume decreases as a result 
of normal thermal contraction and contraction due to 
structural (configurational) rearrangement within the 
liquid. After supercooling below the crystalline melt- 
ing point, a curved inflexion over a temperature range 
of roughly 50° C marks the decrease and eventual ces- 
sation of structural rearrangement. The final portion 
of the curve, of lesser slope, represents normal ther- 
mal contraction of the rigid glass structure. The Active 
(imagined) temperature T f shown in Figure 2.21 serves 
as an index of transition; however, it increases in value 
as the cooling rate is increased. Being disordered, a 
glass has a lower density than its corresponding crys- 
talline form. 

2.6.2 Classification of constituent oxides 

After considering the relation of oxides to glass struc- 
ture, Zachariasen categorized oxides as (1) network- 
formers, (2) intermediates and (3) network-modifiers. 
Oxides other than boric oxide and silica have the 
ability to form network structures. They are listed in 
Table 2.5. 



Figure 2.21 Comparison of the formation of glass and 
crystals from a melt. 


Table 2.5 Classification of oxides in accordance with their 
ability to form glasses (after Tooley) 


Network-formers 

Intermediates 

Network-modifiers 

b 2 o 3 

ai 2 o 3 

MgO 

Si0 2 

sb 2 o 3 

Li 2 0 

Ge0 2 

Zr0 2 

BaO 

P2O5 

Ti0 2 

CaO 

v 2 o 5 

PbO 

Na 2 0 

As 2 6 3 

BeO 

SrO 


ZnO 

k 2 o 


This particular method of classification primarily 
concerns the glass-forming ability of an oxide; thus 
oxides classed as network-modifiers have little or no 
tendency to form network structures. Modifiers can 
have very important practical effects. For instance, the 
alkali-metal oxides of sodium and potassium are used 
to modify the glasses based on silica which account 
for 90% of commercial glass production. Sodium car- 
bonate (Na 2 C0 3 ) and calcium carbonate (CaC0 3 ) are 
added to the furnace charge of silica sand and cullet 
(recycled glass) and dissociate to provide the mod- 
ifying oxides, releasing carbon dioxide. Eventually, 
after melting, fining and degassing operations in which 
the temperature can ultimately reach 1500-1600°C, 
the melt is cooled to the working temperature of 
1000°C. Sodium ions become trapped in the network 
and reduce the number of ‘bridges’ between tetrahedra, 
as shown schematically in Figure 2.22a. These Na + 
cations influence ‘hole’ size and it has been proposed 
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Si 4 Na O 2 Pb 2 


Figure 2.22 Schematic representation of action of modifiers 
in silica glass, (a) Na 2 O breaking-up network; (b) PbO 
entering network. 

that they may cluster rather than distribute themselves 
randomly throughout the network. However, although 
acting as a flux, sodium oxide by itself renders the 
glass water-soluble. This problem is solved by adding a 
stabilising modifier, CaO, to the melt, a device known 
to the glassmakers of antiquity. 1 Ca 2+ ions from dis- 
sociated calcium carbonate also enter the ‘holes’ of 
the network; however, for each nonbridging O 2- anion 
generated, there will be half as many Ca 2+ ions as Na + 
ions (Figure 2.22a). 

There are certain limits to the amounts of the various 
agents that can be added. As a general rule, the glass 
network becomes unstable and tends to crystallize if 
the addition of modifier or intermediate increases the 
numerical ratio of oxygen to silicon ions above a value 
of 2.5. Sometimes the tolerance of the network for an 
added oxide can be extremely high. For instance, up 
to 90% of the intermediate, lead oxide (PbO), can be 
added to silica glass. Pb 2+ cations enter the network 
(Figure 2.22b). Glass formulations are discussed fur- 
ther in Sections 10.5 and 10.6. 

1 Extant 2000-year-old Roman vases are remarkable for their 

beauty and craftsmanship; the Portland vase, recently 

restored by the British Museum, London, and tentatively 

valued at £30 million, is a world-famous example. 


2.7 Polymeric structures 

2.7.1 Thermoplastics 

Having examined the key role of silicon in crystalline 
silicates and glasses, we now turn to another tetravalent 
element, carbon, and examine its central contribution 
to the organic structures known as polymers, or, in a 
more general commercial sense, ‘plastics’. These struc- 
tures are based upon long-chain molecules and can be 
broadly classified in behavioural terms as thermoplas- 
tics, elastomers and thermo sets. In order to illustrate 
some general principles of ‘molecular engineering’, we 
will first consider polyethylene (PE), a linear thermo- 
plastic which can be readily shaped by a combination 
of heat and pressure. Its basic repeat unit of struc- 
ture (mer) is derived from the ethene, or ethylene, 2 
molecule C 2 H 4 and has a relative mer mass M mon of 
28, i.e. (12 x 2) + (1 x 4). This monomer has two free 
bonds and is said to be unsaturated and bifunctional; 
these mers can link up endwise to form a long-chain 
molecule (C 2 H 4 ) n , where n is the degree of polymer- 
ization or number of repeat units per chain. Thus the 
relative mass 3 * of a chain molecule is M = nM mon . 
The resultant chain has a strong spine of covalently- 
bonded carbon atoms that are arranged in a three- 
dimensional zigzag form because of their tetrahedral 
bonding. Polyethylene in bulk can be visualized as a 
tangled mass of very large numbers of individual chain 
molecules. Each molecule may contain thousands of 
mers, typically 10 3 to 10 5 . The carbon atoms act rather 
like ‘universal joints’ and allow it to flex and twist. 

The mass and shape of these linear molecules 
have a profound effect upon the physical, mechani- 
cal and chemical properties of the bulk polymer. As 
the length of molecules increases, the melting points, 
strength, viscosity and chemical insolubility also tend 
to increase. For the idealized and rare case of a 

2 The double bond of ethene 


C = C 
I I 

H H 

is essential for polymerization; it is opened up by heat, 
light, pressure and/or catalysts to form a reactive 
bifunctional monomer 


— C — C — 

I I 

H H 

3 It is common practice to use relative molecular masses or 
‘molecular weights’. Strictly speaking, one should use 
molar masses; that is, amounts of substance containing as 
many elementary entities (molecules, mers), as there are 
atoms in 0.012 kg of the carbon isotope 6C 12 . 
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Figure 2.23 The molecular mass distribution of a 
polyethylene, determined using GPC ( from Mills, 1986; by 
permission of Edward Arnold). 

monodisperse polymer, all the chain molecules are of 
equal length and M is constant. However, in practice, 
polymers are usually polydisperse with a statistical dis- 
tribution of chain lengths (Figure 2.23). The average 
molecular mass M and the ‘spread’ in values between 
short and long chains are important quantitative indi- 
cators of behaviour during processing. 

A polymer sample may be regarded as a collection 
of fractions, or sub-ranges, of molecular size, with 
each fraction having a certain mid-value of molecular 
mass. Let us suppose that the ith fraction contains 
Ni molecules and that the mid-value of the fraction 
is Mi. Hence the total number of molecules for all 
fractions of the sample is ^ N t . In calculating a single 
numerical value, the average molecular mass M, which 
will characterize the distribution of chain sizes, it 
is necessary to distinguish between number- average 
fractions and mass-average fractions of molecules in 
the sample. Thus, in calculating the number- average 
molecular mass M N of the sample, let the number 
fraction be a t . Then = N t / ^ N t and: 

m n = j2 = X) (fi / N >) Mi 

M n is very sensitive to the presence of low-mass 
molecules; accordingly, it is likely to correlate with 
any property that is sensitive to the presence of short- 
length molecules (e.g. tensile strength). 

In similar fashion, the mass-average molecular mass 
M w can be calculated from mass fractions 0*. Since 
0/ = mi/Yl m i > it follows that: 

m w = <piMi = D (»».•/ D m/ ) Mi 
= '^mMi jffmi 

Using the Avogadro constant N A , we can relate mass 
and number fractions as follows: 

mi /Mi = Ni /N a hence m, = /N A 


Substituting for m t , the previous expression for mass- 
average molecular mass becomes: 

Mw = y>M?/y>M,. 

The full molecular mass distribution, showing its 
‘spread’, any skewness, as well as the two average 
values M w and M N , can be determined by gel per- 
meation chromatography (GPC). This indirect method 
is calibrated with data obtained from direct physical 
measurements on solutions of polymers (e.g. osmom- 
etry, light scattering, etc.). For the routine control of 
production processes, faster and less precise methods, 
such as melt flow index (MFI) measurement, are used 
to gauge the average molecular mass. 

M w is particularly sensitive to the long-chain 
molecules and therefore likely to relate to properties 
which are strongly influenced by their presence (e.g. 
viscosity). A/ w always exceeds M N . (In a hypothetical 
monodisperse system, M w = M N .) This inequality 
occurs because a given mass of polymer at one end 
of the distribution contains many short molecules 
whereas, at the other end, the same mass need 
only contain a few molecules. M w is generally 
more informative than M N so far as bulk properties 
are concerned. The ratio M W /M N is known as 
the polydispersivity index; an increase in its value 
indicates an increase in the ‘spread’, or dispersion, 
of the molecular mass distribution (MMD) in a 
polydisperse. In a relatively simple polymer, this ratio 
can be as low as 1.5 or 2 but, as a result of complex 
polymerization processes, it can rise to 50, indicating 
a very broad distribution of molecular size. 

The development of engineering polymers usually 
aims at maximizing molecular mass. For a particular 
polymer, there is a threshold value for the average 
degree of polymerization (n) beyond which properties 
such as strength and toughness develop in a potentially 
useful manner. (Either M w or M N can be used to cal- 
culate n.) It is apparent that very short molecules of 
low mass can slip past each other fairly easily, to the 
detriment of mechanical strength and thermal stability. 
On the other hand, entanglement of chains becomes 
more prevalent as chains lengthen. However, improve- 
ment in properties eventually becomes marginal and 
the inevitable increase in viscosity can make process- 
ing very difficult. Thus, for many practical polymers, n 
values lie in the range 200-2000, roughly correspond- 
ing to molecular masses of 20 000 to 200 000. 

In certain polymer systems it is possible to adjust the 
conditions of polymerization (e.g. pressure, tempera- 
ture, catalyst type) and encourage side reactions at sites 
along the spine of each discrete chain molecule. The 
resultant branches can be short and/or long, even mul- 
tiple. Polyethylene provides an important commercial 
example of this versatility. The original low-density 
form (LDPE) has a high degree of branching, with 
about 15-30 short and long branches per thousand 
carbon atoms, and a density less then 940 kg m -3 . 
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The use of different catalysts permitted lower poly- 
merization pressures and led to the development of 
a high-density form (HDPE) with just a few short 
branches and a density greater than 940 kg m -3 . Being 
more linear and closely-packed than LDPE, HDPE is 
stronger, more rigid and has a melting point (135°C) 
which is 25 °C higher. 

Weak forces exist between adjacent chain molecules 
in polyethylene. Heating, followed by the application 

Table 2.6 Repeat units of typical thermoplastics 


of pressure, causes the molecules to straighten and 
slide past each other easily in a viscous manner. 
Molecular mobility is the outstanding feature of 
thermoplastics and they are well suited to melt- 
extrusion and injection-moulding. These processes 
tend to align the chain molecules parallel to the 
direction of shear, producing a pronounced preferred 
orientation (anisotropy) in the final product. If the 
polymer is branched, rather than simply linear, 


Linear form 
(HDPE) 


Polyethylene 

(PE) 


Branched form 
(LDPE) 


Polyvinyl chloride 
(PVC) 
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branches on adjacent chains will hook on to each 
other and reduce their relative mobility. This effect 
underlines the fundamental importance of molecular 
shape. 

In the important vinyl family of thermoplastics, one 
of the four hydrogen atoms in the C 2 H 4 monomer of 
polyethylene is replaced by either a single atom (chlo- 
rine) or a group of atoms, such as the methyl radical 
CH 3 , the aromatic benzene ring and the acetate 
radical C 2 H 3 0 2 . These four polymers, polyvinyl chlo- 
ride (PVC), polypropylene (PP), polystyrene (PS) and 
polyvinyl acetate (PVAc), are illustrated in Table 2.6. 

Introduction of a different atom or group alongside 
the spine of the molecule makes certain alternative 
symmetries possible. For instance, when all the chlo- 
ride atoms of the PVC molecule lie along the same side 
of each chain, the polymer is said to be isotactic. In the 
syndiotactic form, chlorine atoms are disposed sym- 
metrically around and along the spine of the molecule. 
A fully-randomized arrangement of chlorine atoms is 
known as the atactic form. Like side-branching, tac- 
ticity can greatly influence molecular mobility. During 
addition polymerization, it is possible for two or more 
polymers to compete simultaneously during the join- 
ing of mers and thereby form a copolymer with its 
own unique properties. Thus, a vinyl copolymer is pro- 
duced by combining mers of vinyl chloride and vinyl 
acetate in a random sequence. 1 In some copolymers, 
each type of constituent mer may form alternate blocks 
of considerable length within the copoly meric chains. 
Branching can, of course, occur in copolymers as well 
in ‘straight’ polymers. 

2.7.2 Elastomers 

The development of a relatively small number of 
crosslinking chains between linear molecules can pro- 
duce an elastomeric material which, according to an 
ASTM definition, can be stretched repeatedly at room 
temperature to at least twice its original length and 
which will, upon sudden release of the stress, return 
forcibly to its approximate original length. As shown in 
Figure 2.24, the constituent molecules are in a coiled 
and kinked condition when unstressed; during elastic 
strain, they rapidly uncoil. Segments of the structure 
are locally mobile but the crosslinks tend to prevent 
any gross relative movement of adjoining molecules, 
i.e. viscous deformation. However, under certain con- 
ditions it is possible for elastomers, like most poly- 
mers, to behave in a viscoelastic manner when stressed 
and to exhibit both viscous (time-dependent) and elas- 
tic (instantaneous) strain characteristics. These two 
effects can be broadly attributed, respectively, to the 

1 In the late 1940s this copolymer was chosen to provide the 
superior surface texture and durability required for the first 
long-play microgroove gramophone records. This 
33^ r.p.m. system, which quickly superseded 78 r.p.m. 
shellac records, has been replaced by compact discs made 
from polycarbonate thermoplastics of very high purity. 



Figure 2.24 Unstrained elastomeric structure showing 
entanglement, branching points, crosslinks, loops and free 
ends ( after Young, 1991). 


relative movement and the uncoiling and/or unravel- 
ling of molecular segments. 

Elastomers include natural polymers, such as poly- 
isoprene and polybutadiene in natural rubbers, and 
synthetic polymers, such as polychloroprene ( Neo- 
prene), styrene -butadiene rubber (SBR) and silicone 
rubbers. The structural repeat units of some impor- 
tant elastomers are shown in Table 2.7. In the orig- 
inal vulcanization process, which was discovered by 
C. Goodyear in 1839 after much experimentation, iso- 
prene was heated with a small amount of sulphur to 
a temperature of 140°C, causing primary bonds or 
crosslinks to form between adjacent chain molecules. 
Individual crosslinks take the form C-(S) n -C, where 
n is equal to or greater than unity. Monosulphide links 
(n = 1) are preferred because they are less likely to 
break than longer links. They are also less likely to 
allow slow deformation under stress (creep). Examples 
of the potentially-reactive double bonds that open up 
and act as a branching points for crosslinking are 
shown in Table 2.7. Nowadays, the term vulcaniza- 
tion is applied to any crosslinking or curing process 
which improves elasticity and strength; it does not 
necessarily involve the use of sulphur. Hard rubber 
( Ebonite ) contains 30-50% sulphur and is accordingly 
heavily crosslinked and no longer elastomeric. Its long- 
established use for electrical storage battery cases is 
now being challenged by polypropylene (PP). 

The majority of polymers exhibit a structural change 
known as the glass transition point, T g ; this tem- 
perature value is specific to each polymer and is of 
great practical and scientific significance. (Its impli- 
cations will be discussed more fully in Chapter 11.) 
In general terms, it marks a transition from hard, stiff 
and brittle behaviour (comparable to that of an inor- 
ganic glass) to soft, rubbery behaviour as the tempera- 
ture increases. The previously-given ASTM definition 
described mechanical behaviour at room temperature; 
it follows that the elastomeric condition refers to tem- 
peratures well above T g . Table 2.7 shows that typi- 
cal values for most elastomers lie in the range —50° 
to — 80°C. When an elastomeric structure is heated 
through T g , the segments between the linkage or 
branching points are able to vibrate more vigorously. 
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Table 2.7 Repeat units of typical elastomers 


Polyisoprene 


Polychloroprene 


Polybutadiene 


H H 

I I 

Styrene-butadiene C C 

rubber (SBR) I ! 

C fi H 5 H 
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-50°C 



-120°C 


A simple linear equation expresses the temperature- 
dependence of an elastomer’s response to shear stress: 

/i=NkT 

where /x is the shear modulus, N is the number 
of segments per unit volume of structure (between 
successive points of crosslinking), k is the Boltzmann 
constant and T is absolute temperature. Segments are 
typically about 100 repeat units long. Clearly, for a 
given polymer, the stiffness under shear conditions is 
directly proportional to the absolute temperature. As 
temperature increases, deflection under load becomes 
less. This rather unusual feature has raised engineering 
problems in suspension systems. 

At higher temperatures, well above T g , the poly- 
meric structure is likely to deform slowly under 
applied stress (creep) and ultimately to break down 
into smaller chemical entities, or degrade. As indicated 
in Figure 2.24, unstressed elastomers are disordered 
and non-crystalline. Interestingly, stressing to produce 
a high elastic strain, say 200% or more, will induce 
a significant amount of crystallinity. Stress aligns the 
chains and produces regions in which repeat units 
form ordered patterns. (This effect is readily demon- 
strated by projecting a monochromatic beam of X-rays 
through relaxed and stretched membranes of an elas- 
tomer and comparing the diffraction patterns formed 
upon photographic film.) 


2.7.3 Thermosets 

In the third and remaining category of polymers, 
known generally as thermo sets or network polymers, 
the degree of crosslinking is highly developed. As a 
result, these structures contain many branching points. 
They are rigid and strong, being infinitely braced 
in three dimensions by numerous chain segments of 
relatively short length. Unlike thermoplastics, molec- 
ular mobility is virtually absent and T g is accord- 
ingly high, usually being above 50°C. Thermosets are 
therefore regarded generally as being hard and brittle 
(‘glassy’) materials. Examples of thermoset resins in 
common use include phenol-formaldehyde (P-F resin; 
Bakelite), epoxy resins (structural adhesives; Araldite), 
urea formaldehyde (U-F resin; Beetle ) and polyester 
resins. A resin is a partially-polymerized substance 
which requires further treatment. 

The utilization of thermosets typically involves two 
stages of chemical reaction. In the first stage, a liq- 
uid or solid prepolymer form or precursor is produced 
which is physically suitable for casting or moulding. 
Resins are well-known examples of this intermedi- 
ate state. Their structures consist mostly of linear 
molecules and they are potentially reactive, having a 
specifiable shelf-life. In the second stage, extensive 
crosslinking is promoted by heating, pressure applica- 
tion or addition of a hardening agent, depending upon 
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the type of polymer. This stage is commonly referred 
to as curing. The resultant random network possesses 
the desired stiffness and strength. When heated, this 
structure does not exhibit viscoelastic flow and, being 
both chemically and physically stable, remains unal- 
tered and hard until the decomposition temperature 
is reached. The formation of a thermoset may thus 
be regarded as an irreversible process. With pheno- 
lic resins, final crosslinking is induced by heating, as 
implied by the term thermoset. The latter term is also 
applied, in a looser sense, to polymers in which final 
crosslinking occurs as the result of adding a hard- 
ener; for example, in epoxy resin adhesives and in 
the polyester resin-based matrix of glass-reinforced 
polymer (GRP; Fibreglass). In these substances, an 
increase in the ratio of hardener to resin tends to 
increase the T g value and the modulus. 

Many thermoset structures develop by condensa- 
tion polymerization. This process is quite different in 


chemical character to the process of addition polymer- 
ization by which linear molecules grow in an endwise 
manner in thermoplastics. Although it is one of the 
oldest synthetic polymers, having appeared as Bake- 
lite in the early 1900s, phenolformaldehyde retains its 
industrial importance. It is widely used for injection- 
mouldings in the automotive and electrical industries, 
for surface coatings and as a binder for moulding 
sands in metal foundries. It is therefore appropriate to 
use phenolformaldehyde as an illustrative example of 
the condensation reaction, focusing upon the novolac 
resin route. In the first stage (Figure 2.25a), phenol 
and formaldehyde groups react to form an addition 
compound (methylol derivative). Figure 2.25b shows 
these derivatives joining with phenol groups in a con- 
densation reaction. Methylene bridges (CH 2 ) begin to 
form between adjacent phenol groups and molecules 
of water are released. The two reactions shown dia- 
grammatically in Figures 2.25a and 2.25b produce a 




Figure 2.25 Interaction of phenol and formaldehyde to form a thermoset structure. 
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relatively unreactive novolac resin. (Control of the ini- 
tial phenol/formaldehyde ratio above unity ensures that 
a deficiency of formaldehyde will inhibit crosslinking 
during this first stage.) After drying, grinding and the 
addition of fillers and colourants, the partly-condensed 
resin is treated with a catalysed curing agent which 
acts as a source of formaldehyde. Network formation 
then proceeds during hot-moulding at a temperature 
of 200-300°C. Each phenol group is said to be tri- 
functional because it can contribute three links to the 
three-dimensional random network (Figure 2.25c). 

Another type of phenolic resin, the resole, is pro- 
duced by using an initial phenol/formaldehyde ratio 
of less than unity and a different catalyst. Because of 
the excess of formaldehyde groups, it is then possible 
to form the network structure by heating without the 
addition of a curing agent. 

2.7.4 Crystallinity in polymers 

So far, we have regarded thermoplastic polymers as 
essentially disordered (non-crystalline) structures in 
which chain molecules of various lengths form a tan- 
gled mass. This image is quite appropriate for some 
polymers e.g. polystyrene and polymethyl methacry- 
late (Perspex). However, as indicated in the case of 
stressed elastomers (Section 2.7.2), it is possible for 
chain molecules to form regions in which repeat units 
are aligned in close-packed, ordered arrays. Crystalline 
regions in polymers are generally lamellar in form and 
often small, with their smallest dimension in the order 
of 10-20 nm. Inevitably, because of the complexity of 
the molecules, crystallized regions are associated with 
amorphous regions and defects. However, the degree 
of crystallinity attainable can approach 80-85% by 
volume of the structure. Thus, in polyethylene (PE), 
a simple ‘linear’ thermoplastic that has been the sub- 
ject of much investigation, crystalline regions nucleate 
and grow extremely rapidly during polymerization, 
their formation being virtually unpreventable. Values 
of 50% and 80%, respectively, are quoted for the crys- 
tallinity of its low- and high-density forms, LDPE and 



HDPE. A medium-density form provides an appropri- 
ate balance of strength and flexibility and has been 
used for the yellow distribution pipes which convey 
natural gas in the UK. Crystalline regions are close- 
packed and act as barriers to the diffusion of gases 
and small molecules. This impermeability favours the 
use of LDPE for food wrappings. HDPE, being even 
more crystalline, has a lower permeability to gases and 
vapours than LDPE. 

A typical chain molecule contains hundreds, often 
thousands, of single primary bonds along its zigzag- 
shaped backbone. Rotation about these bonds enables 
the molecule to bend, fold, coil and kink. Under cer- 
tain circumstances, it is possible for it to adopt an 
overall shape, or conformation, and then interlock 
in a close-packed manner alongside other molecules 
(or even itself) to form a three-dimensional crys- 
talline region. Figure 2.26 shows a small portion of 
a crystalline region in polyethylene. Segments of five 
chain molecules, composed of CH 2 groups, are packed 
together and said to be in extended conformation. (For 
convenience, the carbon and hydrogen atoms have 
been ‘shrunk’.) The orthorhombic unit cell, 1 with its 
unequal axes, has been superimposed. 

The criteria which determine the extent to which a 
polymer can crystallize are (1) the conditions under 
which the polymer forms or is processed, and (2) the 
structural character of its molecules. With regard to the 
conditions, crystallization is favoured by slow cooling 
rates. Industrially, rapid cooling usually prevails and 
there is relatively little time available for molecular 
packing and ordering. Application of stress can be used 
to induce crystallization either during or after poly- 
merization. As the degree of crystallization increases, 
the polymer becomes denser; it also becomes more 
resistant to thermal degradation. This important fea- 
ture is reflected in the crystalline melting point T m . 

] PE can also exist in a less stable monoclinic form. Cubic 
forms, with their high symmetry, do not appear in 
polymeric systems; consequently, crystalline polymers 
frequently exhibit pronounced anisotropy. 


(b) 



Figure 2.26 Crystal structure of orthorhombic polyethylene, (a) General view of unit cell, (b) Projection of unit cell parallel to 
the chain direction, c. • carbon atoms, O hydrogen atoms, a = 0.741 nm, b = 0.494 nm, c = 0 .255 nm (from Young, 1991). 
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Figure 2.27 Specific volume versus temperature plots for (a) 100% amorphous polymer, (b) partially-crystalline polymer, 
(c) 100% crystalline polymer. 


At this temperature, which is higher in value than T g , 
the crystalline components of the structure break down 
completely on heating (Figure 2.27). For a given poly- 
mer, T m increases with the degree of crystallinity; for 
example, its values for LDPE and HDPE are 110°C 
and 135°C, respectively. Holding a polymer at a tem- 
perature between T g and T m (annealing) is sometimes 
used as a method for increasing existing crystallinity. 

Turning now to the matter of structural complex- 
ity, crystallization is less likely as molecules become 
longer and more complex. Thus the presence of 
side-branching is a steric hindrance to crystallization, 
particularly if the molecules are atactic in character. 
Iso- and syndio-tacticity are more readily accommo- 
dated; for example, isotactic polypropylene (iPP) is 
a material with useful engineering strength whereas 
atactic PP is a sticky gum. Scrutiny of the various 
repeat units shown earlier in Table 2.6 shows that they 
are usually asymmetrical and may be said to possess 
a ‘head’ and a ‘tail’. Various combinations, or con- 
figurations, can result from addition polymerization, 
such as ‘head to tail’ (XYXYXYXY. . .) or ‘head to 
head’ (XYYXXYYXX. . .), etc. If the molecules have 
a similar and consistent configuration, crystallization is 
favoured. For instance, the previously-mentioned poly- 
mer, isotactic PP, has a ‘head to tail’ configuration 
throughout and can develop a high degree of crys- 
tallinity. Matching configurations favour crystallinity. 
Crystallinity is therefore more likely in copolymers 
with regular block patterns of constituents than in ran- 
dom copolymers. 

Summarizing, regular conformations and/or regular 
configurations favour crystallization. Each of these two 
characteristics is manipulated in a different way. Con- 
formations are changed by physical means (annealing, 
application of stress): changes in configuration require 
the breaking of bonds and are achieved by chemical 
means. 


The fine structure of crystalline regions in commer- 
cial polymers and their relation to associated amor- 
phous regions have been the subject of much research. 
An important advance was made in the 1950s when 
single crystals of polyethylene were produced for 
the first time. A typical method of preparation is 
to dissolve <0.01% PE in xylene at a temperature 
of 135-138°C and then cool slowly to 70-80°C. 
The small PE crystals that precipitate are several 
microns across and only 10-20 nm thick. The thick- 
ness of these platey crystals (lamellae) is temperature- 
dependent. Diffraction studies by transmission electron 
microscope showed that, surprisingly, the axes of the 
chain molecules were approximately perpendicular to 
the two large faces of each lamella. In view of the 
smallness of the crystal thickness relative to the aver- 
age length of molecules, it was deduced that multiple 
chain-folding had occurred during crystallization from 
the mother liquor. In other words, a molecule could 
exhibit an extended and/or folded conformation. The 
chain-folding model has been disputed but is now 
generally accepted. The exact nature of the fold sur- 
face has also been the subject of much debate; typical 
models for folding are shown in Figure 2.28. The mea- 
sured density of these single crystals is less than the 
theoretical value; this feature indicates that irregular 
arrangements exist at fold surfaces and that the crystal 
itself contains defects. Most of the folds or loops are 
tight and each requires only two or three repeat units of 
molecular structure. ‘Loose’ loops of larger radius and 
chain ends (cilia) project from the surfaces of lamel- 
lae. These fundamental studies on single crystals had 
a far-reaching effect upon polymer physics, compara- 
ble in importance to that of single metal crystals upon 
metallurgical science. 

In contrast to the relatively uncongested conditions 
that exist at the surface of an isolated crystal 
growing from a dilute solution, entanglement of 
chain molecules is more likely when a polymer 
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Figure 2.28 Folded chain model for crystallinity in polymers shown in (a) two dimensions and (b) three dimensions ( after 
Askeland, 1990, p. 534; by permission of Chapman and Hall, UK and PWS Publishers, USA). 



Figure 2.29 Polarized light micrograph of two-dimensional 
spherulites grown in a thin film of polyethylene oxide ( from 
Mills, 1986; by permission of Edward Arnold). 

crystallizes from the molten state. Consequently, melt- 
grown crystallites are more complex in physical 
character. Microscopical examination of thin sections 
of certain crystallizable polymers (PE, PS or 
nylon) can reveal a visually-striking spherulitic state 
of crystalline aggregation. In Figure 2.29, three- 
dimensional spherulites have grown in a radial 
manner from a number of nucleating points scattered 
throughout the melt. Nucleation can occur if a few 
molecular segments chance to order locally at a point 
(homogeneous nucleation). However, it is more likely 
that nucleation is heterogeneous, being initiated by the 
presence of particles of foreign matter or deliberately- 
added nucleating agents. Radial growth continues until 
spherulites impinge upon each other. Spherulites are 
much larger than the isolated single crystals previously 


described and range in diameter from microns to 
millimetres, depending upon conditions of growth. 
Thus crystallization at a temperature just below T m 
will proceed from relatively few nucleating points and 
will ultimately produce a coarse spherulitic structure. 
However, this prolonged ‘annealing’, or very slow 
cooling from the molten state, can produce cracks 
between the spherulites (over-crystallization). 

Internally, spherulites consist of lamellae. During 
crystallization, the lamellae grow radially from the 
nucleus. As with solution-grown single crystals, these 
lamellae develop by chain-folding and are about 10 nm 
thick. Space-filling lamellar branches also form. Fre- 
quently, a chain molecule extends within one lamella 
and then leaves to enter another. The resultant inter- 
lamellar ties or links have an important role during 
deformation, as will be discussed later. Inevitably, the 
outward growth of lamellae traps amorphous material. 
Spherulites are about 70-80% crystalline if the con- 
stituent molecules are simple. The layered mixture of 
strong lamellae and weaker amorphous material is rem- 
iniscent of pearlite in steel and, as such, is sometimes 
regarded as a self-assembled (in situ) composite. 

The distinctive patterns seen when spherulitic aggre- 
gates are examined between crossed polars in a light 
microscope (Figure 2.29) provide evidence that the 
lamellae radiating from the nucleus twist in synchro- 
nism. For orthorhombic PE, the c-axes of lamellae lie 
parallel to the length of the extended chain molecules 
and are tangential to the spherulite (Figure 2.30). The 
a- or b - axes are radial in direction. Famellae twist as 
they grow and the c-axes remain normal to the growth 
direction. The refractive index gradually changes for 
each lamella, causing incident plane-polarized light 
to become elliptically polarized in four quadrants of 
each spherulite and to form the characteristic ‘Mal- 
tese cross’ figure. The grain boundary structures of 
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Figure 2.30 Schematic representation of a possible model 
for twisted lamellae in spherulitic polyethylene showing 
chain-folds and intercrystalline links (from Young, 1991). 


poly crystalline metals and alloys are reminiscent of 
spherulitic structures in polymers. Indeed, control of 
spherulite size in partially-crystallized polymers and 
of grain (crystal) size in fully-crystallized metals and 
alloys are well-known means of manipulating strength 
and deformability. 

The mechanism of molecular movement in a poly- 
meric melt has presented a puzzling scientific problem. 
In particular, it was not known exactly how a molecule 
is able to move among entangled molecules and to par- 
ticipate in the progressive chain-folding action that is 
the outstanding feature of crystallization. Clearly, it is 
completely different from atomic and ionic diffusion 
in metals and ceramics. This long-standing problem 



Figure 2.31 Movement of a polymer molecule by reptation. 


was convincingly resolved by de Gennes, 1 who pio- 
neered the idea of reptation, a powerful concept that 
serves to explain various viscous and elastic effects 
in polymers. He proposed that a long-chain molecule, 
acting as an individual, is able to creep lengthwise 
in snake-like movements through the entangled mass 
of molecules. It moves within a constraining ‘reptation 
tube’ (Figure 2.31) which occupies free space between 
molecules; the diameter of this convoluted ‘tube’ is the 
minimum distance between two entangling molecules. 
The reptant motion enables a molecule to shift its cen- 
tre of mass along a ‘tube’ and to progress through a 
tangled polymeric structure. Reptation is more difficult 
for the longer molecules. At the surface of a growing 
crystalline lamella, a molecule can be ‘reeled in from 
its tube’ and become part of the chain-folding process. 
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Chapter 3 

Structural phases: their formation and 
transitions 


3.1 Crystallization from the melt 

3.1.1 Freezing of a pure metal 

At some stage of production the majority of metals 
and alloys are melted and then allowed to solidify as 
a casting. The latter may be an intermediate product, 
such as a large steel ingot suitable for hotworking, 
or a complex final shape, such as an engine cylinder 
block of cast iron or a single-crystal gas-turbine blade 
of superalloy. Solidification conditions determine the 
structure, homogeneity and soundness of cast products 
and the governing scientific principles find application 
over a wide range of fields. For instance, knowledge 
of the solidification process derived from the study 
of conventional metal casting is directly relevant 
to many fusionwelding processes, which may be 
regarded as ‘casting in miniature’, and to the fusion- 
casting of oxide refractories. The liquid/solid transition 
is obviously of great scientific and technological 
importance. 

First, in order to illustrate some basic principles, 
we will consider the freezing behaviour of a melt of 
like metal atoms. The thermal history of a slowly 
cooling metal is depicted in Figure 3.1; the plateau 
on the curve indicates the melting point (m.p.), which 
is pressure-dependent and specific to the metal. Its 
value relates to the bond strength of the metal. Thus, 
the drive to develop strong alloys for service at high 
temperatures has stimulated research into new and 
improved ways of casting high-m.p. alloys based upon 
iron, nickel or cobalt. 

The transition from a highly-disordered liquid to an 
ordered solid is accompanied by a lowering in the 
energy state of the metal and the release of thermal 
energy (latent heat of solidification), forming the arrest 
on the cooling curve shown in Figure 3.1. This order- 
ing has a marked and immediate effect upon other 
structure-sensitive properties of the metal; for instance, 
the volume typically decreases by 1-6%, the electrical 



Figure 3.1 Cooling curve for a pure metal showing possible 
undercooling. 

conductivity rises and the diffusivity, or ability of the 
atoms to migrate, falls. 

Solidification is a classic example of a nucleation 
and growth process. In the general case of freezing 
within the bulk of pure molten metal, minute crys- 
talline nuclei form independently at random points. 
After this homogeneous form of nucleation, contin- 
ued removal of thermal energy from the system causes 
these small crystalline regions to grow independently 
at the expense of the surrounding melt. Throughout 
the freezing process, there is a tendency for bombard- 
ment by melt atoms to destroy embryonic crystals; 
only nuclei which exceed a critical size are able to 
survive. Rapid cooling of a pure molten metal reduces 
the time available for nuclei formation and delays the 
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onset of freezing by a temperature interval of AT. 
This thermal undercooling (or supercooling), which 
is depicted in Figure 3.1, varies in extent, depending 
upon the metal and conditions, but can be as much as 
0.1 -0.3 T m , where T m is the absolute melting point. 
However, commercial melts usually contain suspended 
insoluble particles of foreign matter (e.g. from the 
refractory crucible or hearth) which act as seeding 
nuclei for so-called heterogeneous nucleation. Under- 
cooling is much less likely under these conditions; in 
fact, very pronounced undercooling is only obtainable 
when the melt is very pure and extremely small in 
volume. Homogeneous nucleation is not encountered 
in normal foundry practice. 

The growing crystals steadily consume the melt and 
eventually impinge upon each other to form a struc- 
ture of equiaxed (equal-sized) grains (Figures 3.2 and 
3.3). Heterogeneous nucleation, by providing a larger 
population of nuclei, produces a smaller final grain 
size than homogeneous nucleation. The resultant grain 
(crystal) boundaries are several atomic diameters wide. 
The angle of misorientation between adjacent grains 
is usually greater than 10-15°. Because of this mis- 
fit, such high-angle grain boundaries have a higher 
energy content than the bulk grains, and, on reheating, 
will tend to melt first. (During a grain-contrast etch 
of diamond-polished polycrystalline metal, the etchant 
attacks grain boundaries preferentially by an electro- 
chemical process, producing a broad ‘canyon’ which 
scatters vertically incident light during normal micro- 
scopical examination. The boundary then appears as a 
black line.) 

During the freezing of many metals (and alloys), 
nucleated crystals grow preferentially in certain direc- 
tions, causing each growing crystal to assume a distinc- 
tive, non-faceted 1 tree-like form, known as a dendrite 
(Figure 3.2). In cubic crystals, the preferred axes of 
growth are (100) directions. As each dendritic spike 
grows, latent heat is transferred into the surrounding 
liquid, preventing the formation of other spikes in its 
immediate vicinity. The spacing of primary dendrites 
and of dendritic arms therefore tends to be regular. 
Ultimately, as the various crystals impinge upon each 
other, it is necessary for the interstices of the dendrites 
to be well-fed with melt if interdendritic shrinkage 
cavities are to be prevented from forming. Convection 
currents within the cooling melt are liable to disturb the 
delicate dendritic branches and produce slight angu- 
lar misalignments in the final solidified structure (e.g. 
5-10°). These low-angle boundaries form a lineage 
(macromosaic) structure within the final grain, each 
surface of misfit being equivalent to an array of edge 
dislocations (Chapter 4). Convection currents can also 

x Many metals and a few organic materials grow with 
non-faceted dendritic morphology, e.g. transparent 
succinonitrile-6% camphor has proved a valuable means of 
simulating dendrite growth on a hot-stage optical 
microscope. Most non-metals grow with a faceted 
morphology. 



Figure 3.2 Schematic diagram of three dendrites 
interlocking. 



Figure 3.3 Formation of grains from dendrites of 
Figure 3.2. 


provide thermal pulses which cause dendritic branch 
tips to melt off and enter the main body of the melt 
where they act as ‘kindred nuclei’. Gentle stirring of 
the melt encourages this process, which is known as 
dendrite multiplication, and can be used to produce a 
fine-grained and equiaxed structure (e.g. electromag- 
netic stirring of molten steel). Dendrite multiplication 
is now recognised as an important source of crystals 
in castings and ingots. 

3.1.2 Plane-front and dendritic solidification 
at a cooled surface 

The previous section describes random, multidirec- 
tional crystallization within a cooling volume of pure 
molten metal. In practice, freezing often commences at 
the plane surface of a mould under more complex and 
constrained conditions, with crystals growing counter 
to the general direction of heat flow. The morphology 
of the interface, as well as the final grain structure of 
the casting, are then decided by thermal conditions at 
the solid/liquid interface. 
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Figure 3.4 Plane-front solidification (a) and dendritic solidification (b) of a pure metal, as determined by thermal conditions. 


Figure 3.4a illustrates the case where all the latent 
heat evolved at the interface flows into the solid and 
the temperature gradients in solid and liquid, G s and 
G l , are positive. The solidification front, which moves 
at a velocity R , is stable, isothermal and planar. Any 
solid protuberance which chances to form on this front 
will project into increasingly hotter, superheated liquid 
and will therefore quickly dissolve and be absorbed by 
the advancing front. Planar-front solidification is char- 
acterized by a high G L /R ratio (e.g. slow cooling). If 
the solid is polycrystalline, emerging grain boundaries 
will form grooves in the stable planar front. 

In the alternative scenario (Figure 3.4b), for which 
G l /R has relatively low values, latent heat flows into 
both solid and liquid and G L becomes negative. A 
planar interface becomes unstable. Dendritic protu- 
berances (spikes) grow rapidly into the undercooled 
liquid, which quickly absorbs their evolved latent heat. 
Thermal undercooling is thus an essential prerequi- 
site for dendritic growth; this form of growth becomes 
more and more likely as the degree of thermal under- 
cooling increases. Melts almost invariably undercool 
slightly before solidification so that dendritic mor- 
phologies are very common. (The ability of dilute 
alloy melts to produce a cellular morphology as a 
result of constitutional undercooling will be described 
in Section 3.2.4.3.) 

3.1.3 Forms of cast structure 

Because of the interplay of a variety of physical and 
chemical factors during freezing, the as-cast grain 
structure is usually not as uniform and straightforward 


as those discussed in the previous two sections. 
When solidification commences at the flat surface of 
a metallic ingot mould there is usually an extreme 
undercooling or chilling action which leads to the 
heterogeneous nucleation of a thin layer of small, 
randomly-oriented chill crystals (Figure 3.5). The size 
of these equiaxed crystals is strongly influenced by 
the texture of the mould surface. As the thickness of 
the zone of chill crystals increases, the temperature 
gradient G L becomes less steep and the rate of cooling 
decreases. Crystal growth rather than the nucleation 
of new crystals now predominates and, in many 
metals and alloys, certain favourably-oriented crystals 
at the solid/liquid interface begin to grow into the 
melt. As in the case of the previously-described 



Figure 3.5 Chill-cast ingot structure. 
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dendrites, the rapid growth directions are (100) for 
fee and bee crystals and lie along the direction of 
heat flow. Sideways growth is progressively hindered 
so that the crystals develop a preferred orientation 
and a characteristic columnar form. They therefore 
introduce directionality into the bulk structure; this 
effect will be most pronounced if the metal itself 
is strongly anisotropic (e.g. eph zinc). The preferred 
growth directions for eph crystals are (lOlO). The 
growth form of the interface between the columnar 
crystals and the liquid varies from planar to dendritic, 
depending upon the particular metal (or alloy) and 
thermal conditions. 

As the columnar zone thickens, the temperatures 
within the liquid become more shallow, undercooling 
more prominent and the presence of kindred nuclei 
from dendritic multiplication more likely. Under these 
conditions, independent nucleation (Section 3.1.1) is 
favoured and a central zone of equiaxed, randomly- 
oriented crystals can develop (Figure 3.5). Other fac- 
tors such as a low pouring temperature (low superheat), 
moulds of low thermal conductivity and the presence 
of alloying elements also favour the development of 
this equiaxed zone. There is a related size effect, with 
the tendency for columnar crystals to form decreasing 
as the cross-section of the mould cavity decreases. 
However, in the absence of these influences, growth 
predominates over nucleation, and columnar zone may 
extend to the centre of the ingot (e.g. pure metals). 
The balance between the relative proportions of outer 
columnar crystals and inner equiaxed crystals is impor- 
tant and demands careful control. For some purposes, 
a completely fine-grained stucture is preferred, being 
stronger and more ductile. Furthermore, it will not 
contain the planes of weakness, shown in Figure 3.5, 
which form when columnar crystals impinge upon each 
other obliquely. (In certain specialized alloys, how- 
ever, such as those for high-power magnets and creep- 
resistant alloys, a coarse grain size is prescribed.) 

The addition of various ‘foreign’ nucleating agents, 
known as inoculants, is a common and effective 
method for providing centres for heterogeneous nucle- 
ation within the melt, inhibiting undercooling and 
producing a uniform fine-grained structure. Refining 
the grain structure disperses impurity elements over a 
greater area of grain boundary surface and generally 
benefits mechanical and founding properties (e.g. duc- 
tility, resistance to hot-tearing). However, the need for 
grain refinement during casting operations is often less 
crucial if the cast structure can be subsequently worked 
and/or heat-treated. Nucleating agents must remain 
finely dispersed, must survive and must be wetted by 
the superheated liquid. Examples of inoculants are tita- 
nium and/or boron (for aluminium alloys), zirconium 
or rare earth metals (for magnesium alloys) and alu- 
minium (for steel). Zirconium is an extremely effective 
grain-refiner for magnesium and its alloys. The close 
similarity in lattice parameters between zirconium and 
magnesium suggests that the oriented overgrowth (epi- 
taxy) of magnesium upon zirconium is an important 


factor; however, inoculants have largely been devel- 
oped empirically. 

3.1.4 Gas porosity and segregation 

So far we have tended to concentrate upon the 
behaviour of pure metals. It is now appropriate to 
consider the general behaviour of dissimilar types of 
atoms which, broadly speaking, fall into two main 
categories: those that have been deliberately added for 
a specific purpose (i.e. alloying) and those that are 
accidentally present as undesirable impurities. Most 
metallic melts, when exposed to a furnace atmosphere, 
will readily absorb gases (e.g. oxygen, nitrogen, 
hydrogen). The solubility of gas in liquid metal can 
be expressed by Sievert’s relation, which states that 
the concentration of dissolved gas is proportional to 
the square root of the partial pressure of the gas in the 
contacting atmosphere. Thus, for hydrogen, which is 
one of the most troublesome gases: 

[H soM ion]=K{p(H 2 )} 1/2 (3.1) 

The constant K is temperature-dependent. The solu- 
bility of gases decreases during the course of freezing, 
usually quite abruptly, and they are rejected in the form 
of gas bubbles which may become entrapped within 
and between the crystals, forming weakening blow- 
holes. It follows from Sievert’s relation that reducing 
the pressure of the contacting atmosphere will reduce 
the gas content of the melt; this principle is the basis of 
vacuum melting and vacuum degassing. Similarly, the 
passage of numerous bubbles of an inert, low- solubility 
gas through the melt will also favour gas removal (e.g. 
scavenging treatment of molten aluminium with chlo- 
rine). Conversely, freezing under high applied pres- 
sure, as in the die-casting process for light alloys, 
suppresses the precipitation of dissolved gas and pro- 
duces a cast shape of high density. 

Dissolved gas may precipitate as simple gas bubbles 
but may, like oxygen, react with melt constituents to 
form either bubbles of compound gas (e.g. C0 2 , CO, 
S0 2 , H 2 O vap ) or insoluble non-metallic particles. The 
latter are potential inoculants. Although their presence 
may be accidental, as indicated previously, their delib- 
erate formation is sometimes sought. Thus, a specific 
addition of aluminium, an element with a high chem- 
ical affinity for oxygen, is used to deoxidize molten 
steel in the ladle prior to casting; the resultant par- 
ticles of alumina subsequently act as heterogeneous 
nucleants, refining the grain size. 

Segregation almost invariably occurs during solid- 
ification; unfortunately, its complete elimination is 
impossible. Segregation, in its various forms, can seri- 
ously impair the physical, chemical and mechanical 
properties of a cast material. In normal segregation, 
atoms different to those which are crystallizing can 
be rejected into the melt as the solid/liquid interface 
advances. These atoms may be impurities or, as in the 
case of a solid solution alloy, solute atoms. Insolu- 
ble particles can also be pushed ahead of the interface. 
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Eventually, pronounced macro- segregation can be pro- 
duced in the final regions to solidify, particularly if 
the volume of the cast mass is large. On a finer scale, 
micro-segregation can occur interdendritically within 
both equiaxed and columnar grains (coring) and at the 
surfaces of low- and high-angle grain boundaries. The 
modern analytical technique of Auger electron spec- 
troscopy (AES) is capable of detecting monolayers 
of impurity atoms at grain boundary surfaces and has 
made it possible to study their very significant effect 
upon properties such as ductility and corrosion resis- 
tance (Chapter 5). 

In the other main form of separation process, 1 which 
is known as inverse segregation, thermal contraction of 
the solidified outer shell forces a residual melt of low 
melting point outwards along intergranular channels 
until it freezes on the outside of the casting (e.g. ‘tin 
sweat’ on bronzes, ‘phosphide sweat’ on grey cast 
iron). The direction of this remarkable migration thus 
coincides with that of heat flow, in direct contrast 
to normal macro- segregation. Inverse segregation can 
be prevented by unidirectional solidification. Later, in 
Section 3. 2.4.4, it will be shown how the process of 
zone-refining, as used in the production of high-purity 
materials for the electronics industry, takes positive 
advantage of segregation. 

3.1.5 Directional solidification 

The exacting mechanical demands made upon gas tur- 
bine blades have led to the controlled exploitation of 
columnar crystal growth and the development of direc- 
tional solidification (DS) techniques for superalloys. 2 
As the turbine rotor rotates, the hot blades are subject 
to extremely large centrifugal forces and to thermal 
excursions in temperature. Development has proceeded 
in two stages. First, a wholly columnar grain struc- 
ture, without grain boundaries transverse to the major 
axis of the blade, has been produced during preci- 
sion investment casting by initiating solidification at 
a watercooled copper chill plate in the mould base 
and then slowly withdrawing the vertically-positioned 
mould from the hot zone of an enclosing furnace. Most 
of the heat is removed by the chill plate. A restricted 
number of crystals is able to grow parallel to the major 
axis of the blade. Transverse grain boundaries, which 
have been the initiation sites for intergranular creep 
failures in equiaxed blade structures, are virtually elim- 
inated. Grain shape is mainly dependent upon (1) the 
thermal gradient G L extending into the melt from the 
melt/solid interface and (2) the growth rate R at which 
this interface moves. Graphical plots of G L versus R 

1 Like all separation processes, even so-called chemical 
separations, it is essentially a physical process. 

2 Directional solidification of high-temperature alloys was 
pioneered by F. L. VerSnyder and R. W. Guard at the 
General Electric Company, Schenectady, USA, in the late 
1950s; by the late 1960s, DS blades were being used in gas 
turbines of commercial aircraft. 



Figure 3.6 A single-crystal blade embodying a DS-starter 
block and helical constriction and a test plate facilitating 
orientation check by XRD. 

have provided a useful means for predicting the grain 
morphology of DS alloys. 

In the second, logical stage of development, a single- 
crystal (SC) turbine blade is produced by placing a 
geometrical restriction between a chilled starter block 
and the mould proper (Figure 3.6). This spiral selector 
causes several changes in direction and ensures that 
only one of the upward-growing columnar crystals in 
the starter block survives. A typical production unit 
is illustrated in Figure 3.7. The orientation of every 
blade is checked by means of a computerized X-ray 
diffraction procedure (e.g. the SCORPIO system at 
Rolls-Royce). In the fee nickel-based superalloys, the 
favoured [100] growth direction coincides with the 
major axis of the blade and fortunately offers the best 
overall mechanical properties. For instance, the modu- 
lus of elasticity is low in the (100) directions; conse- 
quently, thermal stresses are reduced and the resistance 
to thermal fatigue enhanced. If full three-dimensional 
control of crystal orientation is required, a seed crystal 
is precisely located close to the chill plate and gives 
the desired epitaxial, or oriented, overgrowth. The pro- 
duction of single-crystal turbine blades by directional 
solidification (DS) techniques has increased blade life 
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Figure 3.7 Schematic diagram of directional solidification 
plant for turbine blades. 

substantially and has enabled operating temperatures to 
be raised by 30°C, thus improving engine efficiency. 
It has also had a far-reaching effect upon the philoso- 
phy of alloy design. Previously, certain elements, such 
as carbon, boron, hafnium and zirconium, were added 
in order to strengthen grain boundary surfaces. Unfor- 
tunately, their presence lowers the incipient melting 
point. The DS route dispenses with the need for them 
and permits beneficially higher temperatures to be used 
during subsequent heat-treatment. The DS approach 
requires to be carefully matched to alloy composition; 
it is possible for it to lower the creep strength of certain 
superalloys. 

3.1.6 Production of metallic single crystals 
for research 

Development of highly- specialized industrial-scale 
techniques, such as the directional solidification 
(DS) of turbine blades and the production of 
silicon, germanium and various compounds for 
semiconductors, owes much to expertise gained over 
many years in producing small single crystals for 
fundamental research. Several methods originally 
developed for metals have been adapted for ceramics. 
Experiments with single crystals of metals (and 
ceramics and polymers) have a special place in the 
history of materials science. The two basic methods 
for preparing single crystals involve (1) solidification 
from the melt and (2) grain growth in the solid state. 

In the simplest version of the solidification method, 
the poly crystalline metal to be converted to a single 
crystal is supported in a horizontal, non-reactive boat 
(e.g. graphite) and made to to freeze progressively 
from one end by passing an electric furnace, with 
its peak temperature set about 10°C above the melt- 
ing point, over the boat. Although several nuclei may 


form during initial solidification, the sensitivity of the 
growth rate to orientation usually results in one of the 
crystals swamping the others and eventually forming 
the entire growth front. The method is particularly use- 
ful for seeding crystals of a predetermined orientation. 
A seed crystal is placed adjacent to the poly crystalline 
sample in the boat and the junction is melted before 
commencing the melting/solidification process. Wire 
specimens may be grown in silica or heat-resistant 
glass tubes internally coated with graphite ( Aquadag ). 
In a modern development of these methods, the sam- 
ple is enclosed in an evacuated silica tube and placed 
in a water-cooled copper boat; passage through a high- 
frequency heating coil produces a melt zone. 

Most solidification techniques for single crystals are 
derived from the Bridgman and Czochralski methods. 
In the former, a pure metal sample is loaded in a 
vertical mould of smooth graphite, tapered to a point 
at the bottom end. The mould is lowered slowly down 
a tubular furnace which produces a narrow melt zone. 
The crystal grows from the point of the mould. In 
the Czochralski method, often referred to as ‘crystal- 
pulling’, a seed crystal is withdrawn slowly from the 
surface of a molten metal, enabling the melt to solidify 
with the same orientation as the seed. Rotation of the 
crystal as it is withdrawn produces a cylindrical crystal. 
This technique is used for the preparation, in vacuo , 
of Si and Ge crystals. 

Crystals may also be prepared by a ‘floating-zone’ 
technique (e.g. metals of high melting point such as W, 
Mo and Ta). A pure poly crystalline rod is gripped at 
the top and bottom in water-cooled grips and rotated 
in an inert gas or vacuum. A small melt zone, pro- 
duced by either a water-cooled radio-frequency coil 
or electron bombardment from a circular filament, is 
passed up its length. High purity is possible because 
the specimen has no contact with any source of con- 
tamination and also because there is a zone-refining 
action (Section 3. 2.4.4). 

Methods involving grain growth in the solid state (2) 
depend upon the annealing of deformed samples. In 
the strain-anneal technique, a fine-grained polycrys- 
talline metal is critically strained approximately 1-2% 
elongation in tension and then annealed in a moving- 
gradient furnace with a peak temperature set below 
the melting point or transformation temperature. Light 
straining produces very few nuclei for crystallization; 
during annealing, one favoured nucleus grows more 
rapidly than the other potential nuclei, which it con- 
sumes. The method has been applied to metals and 
alloys of high stacking-fault energy, e.g. Al, silicon- 
iron, (see Chapter 4). Single crystals of metals with 
low stacking-fault energy, such as Au and Ag, are 
difficult to grow because of the ease of formation 
of annealing twins which give multiple orientations. 
Hexagonal metals are also difficult to prepare because 
deformation twins formed during straining act as effec- 
tive nucleation sites. 
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3.2 Principles and applications of 

phase diagrams 

3.2.1 The concept of a phase 

The term ‘phase’ refers to a separate and identifi- 
able state of matter in which a given substance may 
exist. Being applicable to both crystalline and non- 
crystalline materials, its use provides a convenient 
way of expressing a material’s structure. Thus, in 
addition to the three crystalline forms mentioned in 
Section 2.5.1, the element iron may exist as a liquid 
or vapour, giving five phases overall. Similarly, the 
important refractory oxide silica is able to exist as 
three crystalline phases, quartz, tridymite and cristo- 
balite, as well as a non-crystalline phase, silica glass, 
and as molten silica. Under certain conditions, it is pos- 
sible for two or more different phases to co-exist. The 
glass-reinforced polymer (GRP) known as Fibreglass 
is an example of a two-phase structure. 

When referring to a particular phase in the structure 
of a material, we imply a region comprising a large 
number of atoms (or ions or molecules) and the exis- 
tence of a bounding surface which separates it from 
contiguous phases. Local structural disturbances and 
imperfections are disregarded. Thus, a pure metal or 
an oxide solid solution are each described, by conven- 
tion, as single-phase structures even though they may 
contain grain boundaries, concentration gradients (cor- 
ing) and microdefects, such as vacancies, dislocations 
and voids (Chapter 4). Industrial practice understand- 
ably favours relatively rapid cooling rates, frequently 
causing phases to exist in a metastable condition. Some 
form of ‘triggering’ process, such as thermal activa- 
tion, is needed before a metastable phase can adopt 
the stable, or equilibrium, state of lowest energy (e.g. 
annealing of metals and alloys). These two features, 
structural heterogeneity on a micro-scale and non- 
equilibrium, do not give rise to any untoward scientific 
difficulty. 

The production of the thermal- shock resistant known 
as Vycor provides an interesting example of the 
potential of phase control. 1 Although glasses of very 
high silica content are eminently suitable for high- 
temperature applications, their viscosity is very high, 
making fabrication by conventional methods difficult 
and costly. This problem was overcome by taking 
advantage of phase separation in a workable silica 
containing a high proportion of boric oxide. After 
shaping, this glass is heat-treated at a temperature 
of 500- 600° C in order to induce separation of two 
distinct and interpenetrating glassy phases. Electron 
microscopy reveals a wormlike boron-rich phase sur- 
rounded by a porous siliceous matrix. Leaching in hot 
acid dissolves away the former phase, leaving a porous 

1 Vycor, developed and patented by Corning Glass Works 
(USA) just before World War II, has a final composition of 
96Si02-4B203. It has been used for laboratory ware and 
for the outer windows of space vehicles. 


silica-rich structure. Consolidation heat-treatment at a 
temperature of 1000°C ‘shrinks’ this skeletal structure 
by a remarkable 30%. This product has a low linear 
coefficient of thermal expansion ( a = 0.8 x 10 -60 C; 
0-300°C) and can withstand quenching into iced water 
from a temperature of 900° C, its maximum service 
temperature. 

3.2.2 The Phase Rule 

For a given metallic or ceramic material, there is 
a theoretical condition of equilibrium in which each 
constituent phase is in a reference state of lowest 
energy. The independent variables determining this 
energy state, which are manipulated by scientists and 
technologists, are composition, temperature and pres- 
sure. The Phase Rule derived by Willard Gibbs from 
complex thermodynamical theory provides a device 
for testing multi-phase (heterogeneous) equilibria and 
deciding the number of variables (degrees of free- 
dom) necessary to define the energy state of a sys- 
tem. Its basic equation, P + F = C + 2, relates the 
number of phases present at equilibrium (P) and the 
number of degrees of freedom (F) to the number of 
components (C), which is the smallest number of sub- 
stances of independently-variable composition making 
up the system. For metallic systems, the components 
are metallic elements: for ceramics, the components 
are frequently oxides (e.g. MgO, Si0 2 , etc.). 

Consider Figure 3.8, which is a single-component 
(unary) diagram representing the phase relations for a 
typical pure metal. Transitions such as melting, subli- 
mation and vaporization occur as boundaries between 
the three single-phase fields are crossed. Suppose 
that conditions are such that three phases of a pure 
metal co-exist simultaneously, a unique condition rep- 
resented by the triple-point O. Applying the Phase 
Rule: P = 3, C = 1 and F = 0. This system is said 
to be invariant with no degrees of freedom. Having 
stated that ‘three phases co-exist’, the energy state is 
fully defined and it is unnecessary to specify values 
for variables. One or two phases could be caused to 
disappear, but this would require temperature and/or 
pressure to be changed. Similarly, at a melting point 
on the line between single-phase fields, a solid and 
a liquid phase may co-exist (i.e. P = 2, C = 1 and 
F = 1). Specification of one variable (either tempera- 
ture or pressure) will suffice to define the energy state 
of a particular two-phase equilibrium. A point within 
one of the single-phase fields represents a system that 
is bivariant (F = 2). Both temperature and pressure 
must be specified if its state is to be fully defined. 

Let us consider some practical implications of 
Figure 3.8. The line for the solid/liquid transition is 
shown with a slight inclination to the right in this 
schematic diagram because, theoretically, one may 
reason from Le Chatelier’s principle that a pressure 
increase on a typical melt, from Pi to P 2 , will favour 
a constraint-removing contraction and therefore lead 
to freezing. (Bismuth and gallium are exceptions: like 
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Figure 3.8 Phase diagram for a typical pure metal. 


water, they expand on freezing and the solid/liquid 
line has an opposite slope.) The pressure at the critical 
point C, beyond which liquid and vapour are indis- 
tinguishable, is usually extremely high and consider- 
ably greater than one atmosphere. The diagram refers 
to an enclosed system. For instance, if molten metal 
is in a vacuum system, and the pressure is reduced 
isothermally from Pi to P 3 , then vaporization will be 
favoured. 

For most metals, the pressure value at the 
triple point is far below one atmosphere (1 atm = 
101.325 kN m -2 = 1 bar). For copper and lead, it is 
in the order of 10 -6 atm. The pressures associated with 
the solid/vapour line are obviously even lower. Thus, 
for most metals, vapour pressure is disregarded and it 
is customary to use isobaric (constant pressure) phase 
diagrams in which the composition (concentration) of 
component metals is plotted against temperature. As 
a consequence of this practice, metallurgists use a 
modified form of the Phase Rule equation i.e. P + F = 
C + 1. 

Nevertheless, it is possible for vapour pressure to be 
a highly significant factor in alloy systems formed by 
volatile components such as mercury, zinc, cadmium 
and magnesium: in such cases, it is advisable to take 


advantage of full pressure-temperature diagrams. Fur- 
thermore, the partial pressures of certain gases in a 
contacting atmosphere can be highly significant and at 
elevated temperatures can contribute to serious corro- 
sion of metals (e.g. 0 2 , S 2 (Chapter 12)). Again, the 
pressure variable must be taken into account. 

3.2.3 Stability of phases 

3.2.3. 1 The concept of free energy 
Every material in a given state has a characteristic 
heat content or enthalpy, H , and the rate of change of 
heat content with temperature is equal to the specific 
heat of the material measured at constant pressure, 
C P = (dH /dT) p . A knowledge of the quantity H is 
clearly important to understand reactions but it does 
not provide a criterion for equilibrium, nor does it 
determine when a phase change occurs, as shown by 
the occurrence of both exothermic and endothermic 
reactions. To provide this criterion it is necessary to 
consider a second important property of state known 
as the entropy, S. In statistical terms S may be regarded 
as a measure of the state of disorder of the structure, 
but from classical thermodynamics it may be shown 
that for any material passing through a complete cycle 
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of events 



where d Q is the heat exchanged between the system 
and its surroundings during each infinitesimal step and 
T is the temperature at which the transfer takes place. 

It is then convenient to define a quantity S such that 
d S = d Q/T, so that j>dS = 0; entropy so defined is 
then a state property. At constant pressure, d Q = d H 
and consequently 

dS = d Q/T = C p dT/T 
which by integration gives 

T nT 

0 C p /T)dT = So+ / CpSQnT) 

Jo 

where S is the entropy at T K usually measured 
in J K -1 . The integration constant So represents the 
entropy at absolute zero, which for an ordered crys- 
talline substance is taken to be zero; this is often 
quoted as the third law of thermodynamics. Clearly, 
any reaction or transformation within a system will be 
associated with a characteristic entropy change given 
by 

d S = Sp — S a 

where dS is the entropy of transformation and Sp and 
S a are the entropy values of the new phase j$ and 
the old phase a , respectively. It is a consequence of 
this that any irreversible change which takes place in 
a system (e.g. the combustion of a metal) must be 
accompanied by an increase in the total entropy of the 
system. This is commonly known as the second law 
of thermodynamics. 

The quantity entropy could be used for deciding the 
equilibrium state of a system, but it is much more 
convenient to work in terms of energy. Accordingly, 
it is usual to deal with the quantity TS, which has 
the units of energy, rather than just S, and to separate 
the total energy of the system H into two components 
according to the relation 

H = G + TS 

where G is that part of the energy of the system which 
causes the process to occur and TS is the energy 
requirement resulting from the change involved. The 
term G is known as Gibbs’ free energy and is defined 
by the equation 

G = H -TS 

Every material in a given state will have a charac- 
teristic value of G. The change of free energy accom- 
panying a change represents the ‘driving force’ of the 
change, and is given by the expression 

dG = dH - TdS = dE + PdV - TdS 


All spontaneous changes in a system must be accom- 
panied by a reduction of the total free energy of that 
system, and thus for a change to occur the free energy 
change AG must be negative. It also follows that the 
equilibrium condition of a reaction will correspond to 
the state where dG = 0, i.e. zero driving force. 

For solids and liquids at atmospheric pressure the 
volume change accompanying changes of state is very 
small and hence PdV is also very small. It is therefore 
reasonable to neglect this term in the free energy 
equation and use as the criterion of equilibrium dE — 
TdS = 0. This is equivalent to defining the quantity 
(E — TS) to be a minimum in the equilibrium state, 
for by differentiation 

d (E - TS) = d E- TdS - SdT 

= dE — TdS (since T is constant) 

= 0 for the equilibrium state 

The quantity (E — TS) thus defines the equilibrium 
state at constant temperature and volume, and is given 
the symbol F, the Helmholtz free energy (F = E — 
TS), to distinguish it from the Gibbs free energy 
(G = H — TS). In considering changes in the solid 
state it is thus a reasonable approximation to use F in 
place of G. The enthalpy H is the sum of the internal 
and external energies which reduces to H ~ E when 
the external energy PV is neglected. 

3.2. 3.2 Free energy and temperature 

If a metal undergoes a structural change from phase a 
to phase /3 at a temperature T \ then it does so because 
above this temperature the free energy of the /3 phase, 
Gp, becomes lower than the free energy of the a phase, 
G a . For this to occur the free energy curves must vary 
with temperature in the manner shown in Figure 3.9a. 
It can be seen that at T t the free energy of the a-phase 
is equal to that of the /3-phase so that AG is zero; T t 
is, therefore, the equilibrium transformation point. 

Figure 3.9a also shows that successive transforma- 
tions occur in a given temperature range. The way in 
which the absolute value of the free energy of a crystal 
varies with temperature is shown in Figure 3.9b, where 
H and — TS are plotted as a function of temperature. At 
the transformation temperature, T t , the change in heat 
content AH is equal to the latent heat L, while the 
entropy change AS is equal to L/T t . In consequence, 
a plot of G = H — TS shows no sharp discontinuities 
at T t (since AH = T t AS) or T m but merely a discon- 
tinuity of slope. A plot of G versus temperature for 
each of the three phases considered, a , /3 and liquid, 
would then be of the form shown in Figure 3.9(a). In 
any temperature range the phase with the lowest free 
energy is the stable phase. 

3.2. 3. 3 Free energy and composition 

Of considerable importance in metallurgical practice is 
the increase in entropy associated with the formation of 


S = So + [ 
Jo 
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a disordered solid solution from the pure components. 
This arises because over and above the entropies of the 
pure components A and B, the solution of B in A has 
an extra entropy due to the numerous ways in which 
the two kinds of atoms can be arranged amongst each 
other. This entropy of disorder or mixing is of the form 
shown in Figure 3.10a. 

As a measure of the disorder of a given state we 
can, purely from statistics, consider W the number of 
distributions which belong to that state. Thus, if the 
crystal contains N sites, n of which contain A-atoms 
and (N — n) contain B-atoms, it can be shown that 
the total number of ways of distributing the A and B 



Figure 3.10 Variation with composition of entropy (a) and 
free energy (b) for an ideal solid solution, and for non-ideal 
solid solutions (c) and (d). 


atoms on the N sites is given by 
AM 

W = 

n\(N — n)\ 

This is a measure of the extra disorder of solution, 
since W = 1 for the pure state of the crystal because 
there is only one way of distributing N indistinguish- 
able pure A or pure B atoms on the N sites. To ensure 
that the thermodynamic and statistical definitions of 
entropy are in agreement the quantity, W, which is a 
measure of the configurational probability of the sys- 
tem, is not used directly, but in the form 

S = kin W 

where k is Boltzmann’s constant. From this equation it 
can be seen that entropy is a property which measures 
the probability of a configuration, and that the greater 
the probability, the greater is the entropy. Substituting 
for W in the statistical equation of entropy and using 
Stirling’s approximation 1 we obtain 

S = kln[AM/n!(iV-n)!] 

= k[N In N - n In n - (N - n)ln(N - n)\ 

for the entropy of disorder, or mixing. The form of this 
entropy is shown in Figure 3.10a, where c = n/N is 
the atomic concentration of A in the solution. It is of 
particular interest to note the sharp increase in entropy 
for the addition of only a small amount of solute. This 
fact accounts for the difficulty of producing really pure 
metals, since the entropy factor, — TdS, associated with 
impurity addition, usually outweighs the energy term, 
dH , so that the free energy of the material is almost 
certainly lowered by contamination. 

Stirling’s theorem states that if N is large 

In AM = NlnN — N 
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Figure 3.11 Free energy curves showing extent of phase 
fields at a given temperature. 


While Figure 3.10a shows the change in entropy 
with composition the corresponding free energy 
versus composition curve is of the form shown in 
Figure 3.10b, c or d depending on whether the solid 
solution is ideal or deviates from ideal behaviour. 
The variation of enthalpy with composition, or heat 
of mixing, is linear for an ideal solid solution, but 
if A atoms prefer to be in the vicinity of B atoms 
rather than A atoms, and B atoms behave similarly, the 
enthalpy will be lowered by alloying (Figure 3.10c). A 
positive deviation occurs when A and B atoms prefer 
like atoms as neighbours and the free energy curve 
takes the form shown in Figure 3.10d. In diagrams 
3.10b and 3.10c the curvature dG 2 /dc 2 is everywhere 
positive whereas in 3.10d there are two minima and 
a region of negative curvature between points of 
inflexion 1 given by dG 2 /dc 2 = 0. A free energy curve 
for which d 2 G/dc 2 is positive, i.e. simple U-shaped, 
gives rise to a homogeneous solution. When a region 
of negative curvature exists, the stable state is a phase 
mixture rather than a homogeneous solid solution, 
as shown in Figure 3.11a. An alloy of composition 
c has a lower free energy G c when it exists as a 
mixture of A-rich phase (aq) of composition c A and B- 
rich phase (a 2 ) of composition c B in the proportions 
given by the Lever Rule, i.e. ot\/a 2 = (c B — c)/(c — 
c A ). Alloys with composition c < c A or c > c B exist 
as homogeneous solid solutions and are denoted by 
phases, oq and a 2 respectively. Partial miscibility in the 
solid state can also occur when the crystal structures 
of the component metals are different. The free energy 
curve then takes the form shown in Figure 3.11b, the 
phases being denoted by a and fi. 

3.2.4 Two-phase equilibria 

3.2.4. 1 Extended and limited solid solubility 
Solid solubility is a feature of many metallic and 
ceramic systems, being favoured when the components 
have similarities in crystal structure and atomic (ionic) 
diameter. Such solubility may be either extended (con- 
tinuous) or limited. The former case is illustrated by 
the binary phase diagram for the nickel- copper system 
(Figure 3.12) in which the solid solution ( a ) extends 

^he composition at which d 2 G/dc 2 = 0 varies with 
temperature and the corresponding 

temperature -composition curves are called spinodal lines. 



Figure 3.12 Binary phase diagram for Ni-Cu system 
showing extended solid solubility. 


from component to component. In contrast to the 
abrupt (congruent) melting points of the pure metals, 
the intervening alloys freeze, or fuse, over a range of 
temperatures which is associated with a univariant two- 
phase ( a + liquid) field. This ‘pasty’ zone is located 
between two lines known as the liquidus and solidus. 
(The phase diagrams for Ni-Cu and MgO-FeO sys- 
tems are similar in form.) 

Let us consider the very slow (equilibrating) solidi- 
fication of a 70Ni-30Cu alloy. A commercial version 
of this alloy, Monel , also contains small amounts of 
iron and manganese. It is strong, ductile and resists 
corrosion by all forms of water, including sea-water 
(e.g. chemical and food processing, water treatment). 
An ordinate is erected from its average composition on 
the base line. Freezing starts at a temperature T\. A 
horizontal tie-line is drawn to show that the first crys- 
tals of solid solution to form have a composition oq. 
When the temperature reaches T 2 , crystals of compo- 
sition ol 2 are in equilibrium with liquid of composition 
L 2 . Ultimately, at temperature T 3 , solidification is com- 
pleted as the composition of the crystals coincides 
with the average composition of the alloy. It will be 
seen that the compositions of the a-phase and liquid 
have moved down the solidus and liquidus, respec- 
tively, during freezing. 

Each tie-line joins two points which represent two 
phase compositions. One might visualize that a two- 
phase region in a binary diagram is made up of an infi- 
nite number of horizontal (isothermal) tie-lines. Using 
the average alloy composition as a fulcrum (x) and 
applying the Lever Rule, it is quickly possible to derive 
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mass ratios and fractions. For instance, for equilibrium 
at temperature T 2 in Figure 3.12, the mass ratio of 
solid solution crystals to liquid is xL^Iol^x. Similarly, 
the mass fraction of solid in the two-phase mixture at 
this temperature is xL 2 IL 2 ol 2 . Clearly, the phase com- 
positions are of greater structural significance than the 
average composition of the alloy. If the volumetric 
relations of two phases are required, these being what 
we naturally assess in microscopy, then the above val- 
ues must be corrected for phase density. 

In most systems, solid solubility is far more 
restricted and is often confined to the phase field 
adjacent to the end-component. A portion of a binary 
phase diagram for the copper- beryllium system, which 
contains a primary, or terminal, solid solution, is 
shown in Figure 3.13. Typically, the curving line 
known as the solvus shows an increase in the 
ability of the solvent copper to dissolve beryllium 
solute as the temperature is raised. If a typical 
‘beryllium -copper’ containing 2% beryllium is first 
held at a temperature just below the solidus (solution- 
treated), water-quenched to preserve the o'-phase and 
then aged at a temperature of 425 °C, particles of 
a second phase (y) will form within the o'-phase 
matrix because the alloy is equilibrating in the (a + y) 
field of the diagram. This type of treatment, closely 
controlled, is known as precipitation-hardening; the 
mechanism of this important strengthening process 
will be discussed in detail in Chapter 8. Precipitation- 
hardening of a typical beryllium -copper, which also 
contains up to 0.5% cobalt or nickel, can raise the 
0.1% proof stress to 1200 MN m -2 and the tensile 
strength to 1400 MN m -2 . Apart from being suitable 
for non-sparking tools, it is a valuable spring material, 
being principally used for electrically conductive brush 
springs and contact fingers in electrical switches. 
A curving solvus is an essential feature of phase 
diagrams for precipitation-hardenable alloys (e.g. 
aluminium- copper alloys (Duralumin)). 

When solid-state precipitation takes place, say 
of fi within a matrix of supersaturated a grains, 
this precipitation occurs in one or more of the 
following preferred locations: (1) at grain boundaries, 
(2) around dislocations and inclusions, and (3) on 
specific crystallographic planes. The choice of site 
for precipitation depends on several factors, of which 
grain size and rate of nucleation are particularly 
important. If the grain size is large, the amount of grain 
boundary surface is relatively small, and deposition 
of /3-phase within the grains is favoured. When this 
precipitation occurs preferentially on certain sets of 
crystallographic planes within the grains, the etched 
structure has a mesh-like appearance which is known 
as a Widmanstatten-type structure. 1 Widmanstatten 
structures have been observed in many alloys (e.g. 
overheated steels). 

1 Named after Count Alois von Widmanstatten who 
discovered this morphology within an iron-nickel meteorite 
sample in 1808. 
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Figure 3.13 Cu-rich end of phase diagram for Cu-Be 
system, showing field of primary solid solution (a). 


3.2A.2 Coring 

It is now possible to consider microsegregation, a phe- 
nomenon introduced in Section 3.1.4, in more detail. 
Referring again to the freezing process for a Ni-Cu 
alloy (Figure 3.12), it is clear that the composition of 
the o'-phase becomes progressively richer in copper 
and, consequently, if equilibrium is to be maintained 
in the alloy, the two phases must continuously adjust 
their compositions by atomic migration. In the liquid 
phase such diffusion is relatively rapid. Under indus- 
trial conditions, the cooling rate of the solid phase is 
often too rapid to allow complete elimination of dif- 
ferences in composition by diffusion. Each grain of 
the o'-phase will thus contain composition gradients 
between the core, which will be unduly rich in the 
metal of higher melting point, and the outer regions, 
which will be unduly rich in the metal of lower melting 
point. Such a non-uniform solid solution is said to be 
cored: etching of a polished specimen can reveal a pat- 
tern of dendritic segregation within each cored grain. 
The faster the rate of cooling, the more pronounced 
will be the degree of coring. Coring in chill-cast ingots 
is, therefore, quite extensive. 

The physical and chemical hetereogeneity produced 
by non-equilibrium cooling rates impairs properties. 
Cored structures can be homogenized by annealing. 
For instance, an ingot may be heated to a temperature 
just below the solidus temperature where diffusion 
is rapid. The temperature must be selected with 
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care because some regions might be rich enough in 
low melting point metal to cause localized fusion. 
However, when practicable, it is more effective 
to cold-work a cored structure before annealing. 
This treatment has three advantages. First, dendritic 
structures are broken up by deformation so that 
regions of different composition are intermingled, 
reducing the distances over which diffusion must 
take place. Second, defects introduced by deformation 
accelerate rates of diffusion during the subsequent 
anneal. Third, deformation promotes recrystallization 
during subsequent annealing, making it more likely 
that the cast structure will be completely replaced by 
a generation of new equiaxed grains. Hot- working is 
also capable of eliminating coring. 

3. 2.4. 3 Cellular microsegregation 
In the case of a solid solution, we have seen that it 
is possible for solvent atoms to tend to freeze before 
solute atoms, causing gradual solute enrichment of 
an alloy melt and, under non-equilibrium conditions, 
dendritic coring (e.g. Ni-Cu). When a very dilute 
alloy melt or impure metal freezes, it is possible for 
each crystal to develop a regular cell structure on a 
finer scale than coring. The thermal and compositional 
condition responsible for this cellular microsegregation 
is referred to as constitutional undercooling. 

Suppose that a melt containing a small amount 
of lower-m.p. solute is freezing. The liquid becomes 
increasingly enriched in rejected solute atoms, partic- 
ularly close to the moving solid/liquid interface. The 
variation of liquid composition with distance from 



Distance x 

Figure 3.14 Variation with distance from solid/liquid 
interface of (a) melt composition and (b) actual temperature 
T and freezing temperature T i. 


the interface is shown in Figure 3.14a. There is a 
corresponding variation with distance of the temper- 
ature T l at which the liquid will freeze, since solute 
atoms lower the freezing temperature. Consequently, 
for the positive gradient of melt temperature T shown 
in Figure 3.14b, there is a layer of liquid in which the 
actual temperature T is below the freezing temperature 
T l : this layer is constitutionally undercooled. Clearly, 
the depth of the undercooled zone, as measured from 
the point of intersection, will depend upon the slope 
of the curve for actual temperature, i.e. G L = d T /dx. 
As G l decreases, the degree of constitutional under- 
cooling will increase. 

Suppose that we visualize a tie-line through the two- 
phase region of the phase diagram fairly close to the 
component of higher m.p. Assuming equilibrium, a 
partition or distribution coefficient k can be defined 
as the ratio of solute concentration in the solid to that 
in the liquid, i.e. c s /c L . For an alloy of average com- 
position c 0 , the solute concentration in the first solid to 
freeze is kc 0 , where k < 1, and the liquid adjacent to 
the solid becomes richer in solute than c 0 . The next 
solid to freeze will have a higher concentration of 
solute. Eventually, for a constant rate of growth of 
the solid/liquid interface, a steady state is reached for 
which the solute concentration at the interface reaches 
a limiting value of co/k and decreases exponentially 
within the liquid to the bulk composition. This con- 
centration profile is shown in Figure 3.14a. 

The following relation can be derived by applying 
Fick’s second law of diffusion (Section 6.4.1): 


cl — cq 


1 + 


1 -k 
— exp 



(3.2) 


where x is the distance into the liquid ahead of the 
interface, c L is the solute concentration in the liquid 
at point x, R is the rate of growth, and D is the 
diffusion coefficient of the solute in the liquid. The 
temperature distribution in the liquid can be calculated 
if it is assumed that k is constant and that the liquidus 
is a straight line of slope m. For the two curves of 
Figure 3.14b: 

T = To - mco/k + G L x (3.3) 


and 


T L = To-mco 


1 + 


1 -k 
——exp 
k 



(3.4) 


where To is the freezing temperature of pure solvent, 
T l the liquidus temperature for the liquid of composi- 
tion c L and T is the actual temperature at any point x. 

The zone of constitutional undercooling may be 
eliminated by increasing the temperature gradient G L , 
such that: 


G l > dT L /dx 


(3.5) 


Substituting for T h and putting [1 — (Rx/D] for the 
exponential gives the critical condition: 


G l mc 0 /I - k\ 

~R > ~D V k J 


(3.6) 
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Figure 3.15 The breakdown of a planar solid-liquid interface (a), (b) leading to the formation of a cellular structure of the 
form shown in (c) for Sn/0.5 at.% Sb x 140. 


This equation summarizes the effect of growth condi- 
tions upon the transition from planar to cellular growth 
and identifies the factors that stabilize a planar inter- 
face. Thus, a high G L , low R and low c 0 will reduce 
the tendency for cellular (and dendritic) structures to 
form. 

The presence of a zone of undercooled liquid 
ahead of a macroscopically planar solid/liquid interface 
(Section 3.1.2) makes it unstable and an interface with 
cellular morphology develops. The interface grows 
locally into the liquid from a regular array of points on 
its surface, forming dome-shaped cells. Figures 3.15a 
and 3.15b show the development of domes within a 
metallic melt. As each cell grows by rapid freezing, 
solute atoms are rejected into the liquid around its base 
which thus remains unfrozen. This solute-rich liquid 
between the cells eventually freezes at a much lower 
temperature and a crystal with a periodic columnar 
cell structure is produced. Solute or impurity atoms 
are concentrated in the cell walls. Decantation of 
a partly- solidified melt will reveal the characteristic 
surface structure shown in Figure 3.15c. The cells 
of metals are usually hexagonal in cross-section and 
about 0.05-1 mm across: for each grain, their major 
axes have the same crystallographic orientation to 
within a few minutes of arc. It is often found that 
a lineage or macromosaic structure (Section 3.1.1) is 
superimposed on the cellular structure; this other form 
of sub-structure is coarser in scale. 

Different morphologies of a constitutionally- 
cooled surface, other than cellular, are possible. 
A typical overall sequence of observed growth 
forms is planar/cellular/cellular dendritic/dendritic. 
Substructures produced by constitutional undercooling 
have been observed in ‘doped’ single crystals and 
in ferrous and non-ferrous castings/weldments. 1 When 

! The geological equivalent, formed by very slowly cooling 
magma, is the hexagonal-columnar structure of the Giant’s 
Causeway, Northern Ireland. 


the extent of undercooling into the liquid is increased 
as, for example, by reducing the temperature gradient 
G l , the cellular structure becomes unstable and a few 
cells grow rapidly as cellular dendrites. The branches 
of the dendrites are interconnected and are an extreme 
development of the dome- shaped bulges of the cell 
structure in directions of rapid growth. The growth of 
dendrites in a very dilute, constitutionally-undercooled 
alloy is slower than in a pure metal because solute 
atoms must diffuse away from dendrite/liquid surfaces 
and also because their growth is limited to the 
undercooled zone. Cellular impurity-generated sub- 
structures have also been observed in ‘non-metals’ as a 
result of constitutional undercooling. Unlike the dome- 
shaped cells produced with metals, non-metals produce 
faceted projections which relate to crystallographic 
planes. For instance, cells produced in a germanium 
crystal containing gallium have been reported in which 
cell cross-sections are square and the projection tips 
are pyramid- shaped, comprising four octahedral {111} 
planes. 

3. 2. 4. 4 Zone-refining 

Extreme purification of a metal can radically improve 
properties such as ductility, strength and corrosion- 
resistance. Zone-refining was devised by W. G. Pfann, 
its development being ‘driven’ by the demands of the 
newly invented transistor for homogeneous and ultra- 
pure metals (e.g. Si, Ge). The method takes advantage 
of non-equilibrium effects associated with the ‘pasty’ 
zone separating the liquidus and solidus of impure 
metal. Considering the portion of Figure 3.12 where 
addition of solute lowers the liquidus temperature, 
the concentration of solute in the liquid, c L , will 
always be greater than its concentration c s in the 
solid phase; that is, the distribution coefficient k = 
c s /c L is less than unity. If a bar of impure metal 
is threaded through a heating coil and the coil is 
slowly moved, a narrow zone of melt can be made 
to progress along the bar. The first solid to freeze is 
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purer than the average composition by a factor of k , 
while that which freezes last, at the trailing interface, 
is correspondingly enriched in solute. A net movement 
of impurity atoms to one end of the bar takes place. 
Repeated traversing of the bar with a set of coils 
can reduce the impurity content well below the limit 
of detection (e.g. <1 part in 10 10 for germanium). 
Crystal defects are also eliminated: Pfann reduced 
the dislocation density in metallic and semi-metallic 
crystals from about 3.5 x 10 6 cm -2 to almost zero. 
Zone -refining has been used to optimize the ductility of 
copper, making it possible to cold-draw the fine-gauge 
threads needed for interconnects in very large-scale 
integrated circuits. 

3.2.5 Three-phase equilibria and reactions 

3.2.5. 1 The eutectic reaction 
In many metallic and ceramic binary systems it is 
possible for two crystalline phases and a liquid to co- 
exist. The modified Phase Rule reveals that this unique 
condition is invariant; that is, the temperature and 
all phase compositions have fixed values. Figure 3.16 
shows the phase diagram for the lead- tin system. 
It will be seen that solid solubility is limited for 
each of the two component metals, with a and /3 
representing primary solid solutions of different crystal 
structure. A straight line, the eutectic horizontal, passes 
through three phase compositions (a e , L e and /? e ) at the 
temperature T e . As will become clear when ternary 
systems are discussed (Section 3.2.9), this line is a 
collapsed three-phase triangle: at any point on this 
line, three phases are in equilibrium. During slow 
cooling or heating, when the average composition of 
an alloy lies between its limits, a e and /3 e , a eutectic 
reaction takes place in accordance with the equation 
L e ^ a e + p e . The sharply-defined minimum in the 
liquidus, the eutectic (easy-melting) point, is a typical 
feature of the reaction. 

Consider the freezing of a melt, average composition 
37Pb-63Sn. At the temperature T e of approximately 
180°C, it freezes abruptly to form a mechanical mix- 
ture of two solid phases, i.e. Liquid L e a e + 
From the Lever Rule, the a/f$ mass ratio is approx- 
imately 9:11. As the temperature falls further, slow 
cooling will allow the compositions of the two phases 
to follow their respective solvus lines. Tie-lines across 
this (a + p) field will provide the mass ratio for any 
temperature. In contrast, a hypoeutectic alloy melt, say 
of composition 70Pb-30Sn, will form primary crystals 
of a over a range of temperature until T t is reached. 
Successive tie-lines across the ( a + Liquid) field show 
that the crystals and the liquid become enriched in tin 
as the temperature falls. When the liquid composition 
reaches the eutectic value L e , all of the remaining liq- 
uid transforms into a two-phase mixture, as before. 
However, for this alloy, the final structure will com- 
prise primary grains of a? in a eutectic matrix of a and 
ft. Similarly, one may deduce that the structure of a 
solidified hyper-eutectic alloy containing 30Pb-70Sn 


will consist of a few primary /3 grains in a eutectic 
matrix of a and p. 

Low-lead or low-tin alloys, with average composi- 
tions beyond the two ends of the eutectic horizontal, 1 
freeze by transforming completely over a small range 
of temperature into a primary phase. (Changes in com- 
position are similar in character to those described 
for Figure 3.12) When the temperature ‘crosses’ the 
relevant solvus, this primary phase becomes unsta- 
ble and a small amount of second phase precipitates. 
Final proportions of the two phases can be obtained 
by superimposing a tie-line on the central two-phase 
field: there will be no signs of a eutectic mixture in 
the micro structure. 

The eutectic (37Pb-63Sn) and hypo-eutectic 
(70Pb-30Sn) alloys chosen for the description of 
freezing represent two of the numerous types of solder 2 
used for joining metals. Eutectic solders containing 
60-65% tin are widely used in the electronics industry 
for making precise, high-integrity joints on a mass- 
production scale without the risk of damaging heat- 
sensitive components. These solders have excellent 
‘wetting’ properties (contact angle <10°), a low 
liquidus and a negligible freezing range. The long 
freezing range of the 70Pb-30Sn alloy (plumbers’ 
solder) enables the solder at a joint to be ‘wiped’ while 
‘pasty’. 

The shear strength of the most widely-used solders is 
relatively low, say 25-55 MN m -2 , and mechanically- 
interlocking joints are often used. Fluxes (corrosive 
zinc chloride, non-corrosive organic resins) facilitate 
essential ‘wetting’ of the metal to be joined by dis- 
solving thin oxide films and preventing re-oxidation. 
In electronic applications, minute solder preforms have 
been used to solve the problems of excess solder and 
flux. 

Figure 3.16 shows the sequence of structures 
obtained across the breadth of the Pb-Sn system. 
Cooling curves for typical hypo-eutectic and eutectic 
alloys are shown schematically in Figure 3.17a. 
Separation of primary crystals produces a change in 
slope while heat is being evolved. Much more heat 
is evolved when the eutectic reaction takes place. 
The lengths (duration) of the plateaux are proportional 
to the amounts of eutectic structure formed, as 
summarized in Figure 3.17b. Although it follows that 
cooling curves can be used to determine the form of 
such a simple system, it is usual to confirm details by 
means of microscopical examination (optical, scanning 
electron) and X-ray diffraction analysis. 

theoretically, the eutectic horizontal cannot cut the vertical 
line representing a pure component: some degree of solid 
solubility, however small, always occurs. 

2 Soft solders for engineering purposes range in composition 
from 20% to 65% tin; the first standard specifications for 
solders were produced in 1918 by the ASTM. The USA is 
currently contemplating the banning of lead-bearing 
products; lead-free solders are being sought. 
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Figure 3.16 Phase diagram for Ph-Sn system. Alloy 1: 63Sn-37Pb, Alloy 2: 70Pb-30Sn, Alloy 3: 70Sn-30Pb. 



Figure 3.17 (a) Typical cooling curves for hypo-eutectic alloy 2 and eutectic alloy 1 in Figure 3.16 and (b) dependence of 
duration of cooling arrest at eutectic temperature T f on composition. 


3. 2. 5. 2 The peritectic reaction 
Whereas eutectic systems often occur when the melt- 
ing points of the two components are fairly similar, the 
second important type of invariant three-phase condi- 
tion, the peritectic reaction, is often found when the 
components have a large difference in melting points. 


Usually they occur in the more complicated systems; 
for instance, there is a cascade of five peritectic reac- 
tions in the Cu-Zn system (Figure 3.20). 

A simple form of peritectic system is shown in, 
Figure 3.18a; although relatively rare in practice (e.g. 
Ag-Pt), it can serve to illustrate the basic principles. 
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Figure 3.18 (a) Simple peritectic system; (b) development of a peritectic ‘ wall \ 


A horizontal line, the key to the reaction, links 
three critical phase compositions; that is, a p , p v and 
liquid Lp. A peritectic reaction occurs if the average 
composition of the alloy crosses this line during either 
slow heating or cooling. It can be represented by the 
equation a p + L p ^ p p . Binary alloys containing less 
of component B than the point a p will behave in the 
manner previously described for solid solutions. A 
melt of alloy 1, which is of peritectic composition, will 
freeze over a range of temperature, depositing crystals 
of primary a-phase. The melt composition will move 
down the liquidus, becoming richer in component B. 
At the peritectic temperature T p , liquid of composition 
L p will react with these primary crystals, transforming 
them completely into a new phase, p, of different 
crystal structure in accordance with the equation a p + 
T p -> p p . In the system shown, p remains stable during 
further cooling. Alloy 2 will aso deposit primary a , 
but the reaction at temperature L p will not consume 
all these crystals and the final solid will consist of p 
formed by peritectic reaction and residual a. Initially, 
the alp mass ratio will be approximately 2.5 to 1 
but both phases will adjust their compositions during 
subsequent cooling. In the case of alloy 3, fewer 
primary crystals of a form: later, they are completely 
destroyed by the peritectic reaction. The amount of p 
in the resultant mixture of p and liquid increases until 
the liquid disappears and an entire structure of p is 
produced. 

The above descriptions assume that equilibrium is 
attained at each stage of cooling. Although very slow 
cooling is unlikely in practice, the nature of the peri- 
tectic reaction introduces a further complication. The 
reaction product p tends to form a shell around the 
particles of primary a: its presence obviously inhibits 
the exchange of atoms by diffusion which equilibrium 
demands (Figure 3.18b). 


3.2. 5. 3 Classification of three-phase equilibria 
The principal invariant equilibria involving three con- 
densed (solid, liquid) phases can be conveniently 
divided into eutectic- and peritectic-types and classi- 
fied in the manner shown in Table 3.1. Interpretation of 
these reactions follows the methodology already set out 
for the more common eutectic and peritectic reactions. 

The inverse relation between eutectic- and peritectic - 
type reactions is apparent from the line diagrams. 
Eutectoid and peritectoid reactions occur wholly in the 
solid state. (The eutectoid reaction y ^ a + Fe 3 C is 
the basis of the heat-treatment of steels.) In all the 
systems so far described, the components have been 
completely miscible in the liquid state. In monotectic 
and syntectic systems, the liquid phase field contains 
a region in which two different liquids (L\ and L 2 ) are 
immiscible. 

3.2.6 Intermediate phases 

An intermediate phase differs in crystal structure from 
the primary phases and lies between them in a phase 
diagram. In Figure 3.19, which shows the diagram 
for the Mg -Si system, Mg 2 Si is the intermediate 
phase. Sometimes intermediate phases have definite 
stoichiometric ratios of constituent atoms and appear 
as a single vertical line in the diagram. However, 
they frequently exist over a range of composition and 
it is therefore generally advisable to avoid the term 
‘compound’ . 

In some diagrams, such as Figure 3.19, they extend 
from room temperature to the liquidus and melt or 
freeze without any change in composition. Such a 
melting point is said to be congruent: the melting 
point of a eutectic alloy is incongruent. A congruently 
melting phase provides a convenient means to divide 
a complex phase diagram (binary or ternary) into 
more readily understandable parts. For instance, an 
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Table 3.1 Classification of three-phase equilibria 


Eutectic-type Eutectic Liq a + f 

reactions 



Al-Si, Pb-Sn, Cu-Ag 
AI2O3— Si02, AI2O3— Zr02 


Eutectoid y ^ a + ft 


Monotectic Liq! # a + Liq 2 


Peritectic-type Peritectic a + Liq ^ f 

reactions 


Peritectoid a + f> ^ y 

Syntectic Liq! + Liq 2 ^ a 



Fe-C, Cu-Zn 

Cu-Pb, Ag-Ni 
Si02-Ca0 

Cu-Zn, Ag-Pt 

Ag-Al 

Na-Zn 


at % Si ► 

0 20 40 60 80 1 00 



Figure 3.19 Phase diagram for Mg -Si system showing intermediate phase Mg2Si (after Brandes and Brook, 1992 ). 


ordinate through the vertex of the intermediate phase in 
Figure 3.19 produces two simple eutectic sub-systems. 
Similarly, an ordinate can be erected to pass through 
the minimum (or maximum) of the liquidus of a solid 
solution (Figure 3.38b). 

In general, intermediate phases are hard and brittle, 
having a complex crystal structure (e.g. Fe 3 C, CuAl 2 
( 1 0 )). For instance, it is advisable to restrict time and 


temperature when soldering copper alloys, otherwise it 
is possible for undesirable brittle layers of Cu 3 Sn and 
Cu 6 Sn 5 to form at the interface. 

3.2.7 Limitations of phase diagrams 

Phase diagrams are extremely useful in the interpre- 
tation of metallic and ceramic structures but they are 
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subject to several restriction. Primarily, they identify 
which phases are likely to be present and provide com- 
positional data. The most serious limitation is that they 
give no information on the structural form and distribu- 
tion of phases (e.g. lamellae, spheroids, intergranular 
films, etc.). This is unfortunate, since these two fea- 
tures, which depend upon the surface energy effects 
between different phases and strain energy effects due 
to volume and shape changes during transformations, 
play an important role in the mechanical behaviour 
of materials. This is understood if we consider a two- 
phase (a + /3) material containing only a small amount 
of /3-phase. The /3-phase may be dispersed evenly as 
particles throughout the a-grains, in which case the 
mechanical properties of the material would be largely 
governed by those of the a-phase. However, if the f>- 
phase is concentrated at grain boundary surfaces of the 
a-phase, then the mechanical behaviour of the mate- 
rial will be largely dictated by the properties of the 
/3-phase. For instance, small amounts of sulphide par- 
ticles, such as grey manganese sulphide (MnS), are 
usually tolerable in steels but sulphide films at the 
grain boundaries cause unacceptable embrittlement. 

A second limitation is that phase diagrams por- 
tray only equilibrium states. As indicated in previous 
sections, alloys are rarely cooled or heated at very 
slow rates. For instance, quenching, as practised in the 
heat-treatment of steels, can produce metastable phases 
known as martensite and bainite that will then remain 
unchanged at room temperature. Neither appears in 
phase diagrams. In such cases it is necessary to devise 
methods for expressing the rate at which equilibrium 
is approached and its temperature-dependency. 

3.2.8 Some key phase diagrams 

3.2.8. 1 Copper-zinc system 
Phase diagrams for most systems, metallic and 
ceramic, are usually more complex than the examples 
discussed so far. Figure 3.20 for the Cu-Zn system 
illustrates this point. The structural characteristics and 
mechanical behaviour of the industrial alloys known 
as brasses can be understood in terms of the copper- 
rich end of this diagram. Copper can dissolve up to 
40% w/w of zinc and cooling of any alloy in this range 
will produce an extensive primary solid solution (fcc- 
a). By contrast, the other primary solid solution ( 37 ) is 
extremely limited. A special feature of the diagram is 
the presence of four intermediate phases (/3, y, 5, s). 
Each is formed during freezing by peritectic reaction 
and each exists over a range of composition. Another 
notable feature is the order- disorder transformation 
which occurs in alloys containing about 50% zinc 
over the temperature range 450-470°C. Above this 
temperature range, bcc /3-phase exists as a disordered 
solid solution. At lower temperatures, the zinc atoms 
are distributed regularly on the bcc lattice: this ordered 
phase is denoted by ft . 

Suppose that two thin plates of copper and zinc are 
held in very close contact and heated at a temperature 



Figure 3.20 Phase diagram for copper-zinc (from Raynor; 
courtesy of the Institute of Metals). 

of 400° C for several days. Transverse sectioning of the 
diffusion couple will reveal five phases in the sequence 
al ply lelr), separated from each other by a planar inter- 
face. The 5-phase will be absent because it is unstable 
at temperatures below its eutectoid horizontal (560° C). 
Continuation of diffusion will eventually produce one 
or two phases, depending on the original proportions 
of copper and zinc. 

3.2. 8.2 Iron -carbon system 

The diagram for the part of the Fe-C system shown 
in Figure 3.21 is the basis for understanding the 
microstructures of the ferrous alloys known as steels 
and cast irons. Dissolved carbon clearly has a pro- 
nounced effect upon the liquidus, explaining why the 
difficulty of achieving furnace temperatures of 1600°C 
caused large-scale production of cast irons to predate 
that of steel. The three allotropes of pure iron are a-Fe 
(bcc), y-Fe (fee) and 5-Fe (bcc ). 1 Small atoms of car- 
bon dissolve interstitially in these allotropes to form 
three primary solid solutions: respectively, they are a- 
phase (ferrite), y-phase (austenite) and 5-phase. At the 
other end of the diagram is the orthorhombic interme- 
diate phase FesC, which is known as cementite. 

The large difference in solid solubility of carbon 
in austenite and ferrite, together with the existence of 
a eutectoid reaction, are responsible for the versatile 
behaviour of steels during heat-treatment. Aei, Ae 2 , 
Ae 3 and A cm indicate the temperatures at which phase 
changes occur: they are arrest points for equilibria 
detected during thermal analysis. For instance, slow 
cooling enables austenite (0.8% C) to decompose 
eutectoidally at the temperature Aei and form the 
microconstituent pearlite, a lamellar composite of soft, 

! The sequence omits /3-Fe, a term once used to denote a 
non-magnetic form of a-Fe which exists above the Curie 
point. 
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Figure 3.21 Phase diagram for Fe-C system ( dotted lines represent iron-graphite equilibrium ). 


ductile ferrite (initially 0.025% C) and hard, brittle 
cementite (6.67% C). Quenching of austenite from a 
temperature above Ae 3 forms a hard metastable phase 
known as martensite. From the diagram one can see 
why a medium-carbon (0.4%) steel must be quenched 
from a higher Ae 3 temperature than a high-carbon 
(0.8%) steel. Temperature and composition ‘windows’ 
for some important heat-treatment operations have 
been superimposed upon the phase diagram. 

3. 2. 8. 3 Copper-lead system 
The phase diagram for the Cu-Pb system (Figure 3.22) 
provides an interesting example of extremely limited 
solubility in the solid state and partial immiscibility 
in the liquid state. The two components differ greatly 
in density and melting point. Solid solutions, a and 
p, exist at the ends of the diagram. The ‘miscibility 
gap’ in the liquid phase takes the form of a dome- 
shaped two-phase (Li+L 2 ) field. At temperatures 
above the top of the dome, the critical point, 


liquid miscibility is complete. The upper isothermal 
represents a monotectic reaction, i.e. L\ ^ a + L 2 . 

On cooling, a hyper-monotectic 50Cu-50Pb melt 
will separate into two liquids of different composition. 
The degree of separation depends on cooling condi- 
tions. Like oil and water, the two liquids may form an 
emulsion of droplets or separate into layers according 
to density. At a temperature of 954° C, the copper-rich 
liquid Li disappears, forming a crystals and more of 
the lead-rich liquid L 2 . This liquid phase gets richer 
in lead and eventually decomposes by eutectic reac- 
tion, i.e. L 2 ^ a + p. (Tie-lines can be used for all 
two-phase fields, of course; however, because of den- 
sity differences, mass ratios may differ greatly from 
observed volume ratios.) 

The hypo-monotectic 70Cu-30Pb alloy, rapidly 
cast, has been used for steel-backed bearings: dispersed 
friction-reducing particles of lead-rich p are supported 
in a supporting matrix of copper-rich a. Binary 
combinations of conductive metal (Cu, Ag) and 



62 Modern Physical Metallurgy and Materials Engineering 



refractory arc -resistant metal (W, Mo, Ni) have 
been used for electrical contacts (e.g. 60Ag-40Ni). 
These particular monotectic systems, with their liquid 
immiscibility, are difficult to cast and are therefore 
made by powder metallurgy techniques. 

3. 2. 8. 4 Alumina- silica system 
The binary phase diagram for alumina- silica 
(Figure 3.23) is of special relevance to the refractories 
industry, an industry which produces the bricks, 
slabs, shapes, etc. for the high-temperature plant that 


make steel-making, glass-making, heat-treatment, etc. 
possible. The profile of its liquidus shows a minimum 
and thus mirrors the refractoriness of aluminosilicate 
refractories (Figure 3.24). Refractoriness, the prime 
requirement of a refractory, is commonly determined 
by an empirical laboratory test. A sample cone of a 
given refractory is placed on a plaque and located at 
the centre of a ring of standard cones, each of which 
has a different softening or slumping temperature and 
is identified by a Pyrometric Cone Equivalent (PCE) 
number. All cones are then slowly heated until the 
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Figure 3.23 Phase diagram for SiO 2 -AI 2 O 3 system. 



sample cone bends or slumps under gravity: the PCE 
of a standard cone that has behaved similarly is noted 
and taken to represent the refractoriness of the sample. 
It will be realized that the end-point of the PCE test 
is rather arbitrary, being a rising-temperature value. 
(Other requirements may include refractoriness-under- 
load, resistance to thermal shock, resistance to attack 
by molten slag, low thermal conductivity, etc.) 

The steeply-descending liquidus shows the adverse 
effect of a few per cent of alumina on the refractori- 
ness of silica bricks. (Sodium oxide, Na 2 0, has an 
even more pronounced eutectic-forming effect and is 
commonly used to flux sand particles during glass- 
melting.) The discovery of this eutectic point led to 
immediate efforts to keep the alumina content as far 
below 5% as possible. Silica refractories are made by 
firing size-graded quartzite grains and a small amount 


of lime (CaO) flux at a temperature of 1450°C: the 
final structure consists of tridymite, cristobalite and a 
minimal amount of unconverted quartz. Tridymite is 
preferred to cristobalite because of the large volume 
change (~1%) associated with the a/p cristobalite 
inversion. The lime forms an intergranular bond of 
Si0 2 -CaO glass. Chequerwork assemblies of silica 
bricks are used in hot-blast stoves that regeneratively 
preheat combustion air for iron-making blast furnaces 
to temperatures of 1200-1300°C. Silica bricks have 
a surprisingly good refractoriness-under-load at tem- 
peratures only 50°C or so below the melting point 
of pure silica 1723°C). Apparently, the fired grains 
of tridymite and cristobalite interlock, being able to 
withstand a compressive stress of, say, 0.35 MN m -2 
at these high temperature levels. 

Firebricks made from carefully- selected low-iron 
clays are traditionally used for furnace-building. These 
clays consist essentially of minute platey crystals 
of kaolinite, Al 2 (Si 2 0 5 )(0H) 4 : the (OH) groups are 
expelled during firing. The alumina content (46%) 
of fired kaolinite sets the upper limit of the normal 
composition range for firebricks. Refractoriness rises 
steeply with alumina content and aluminous fireclays 
containing 40% or more of alumina are therefore par- 
ticularly valued. A fireclay suitable for refractories 
should have a PCE of at least 30 (equivalent to 
1670°C): with aluminous clays the PCE can rise 
to 35 (1770°C). Firing the clay at temperatures of 
1200-1400°C forms a glassy bond and an interlock- 
ing mass of very small lath-like crystals of mullite; 
this is the intermediate phase with a narrow range of 
composition which marks the edge of the important 
(mullite + corundum) plateau. High- alumina bricks, 
with their better refractoriness, have tended to replace 
firebricks. An appropriate raw material is obtained 
by taking clay and adding alumina (bauxite, artifi- 
cial corundum) or a ‘sillimanite-type’ mineral, Al 2 Si0 5 
(andalusite, sillimanite, kyanite). 

Phase transformations in ceramic systems are 
generally more sluggish than in metallic systems and 
steep concentration gradients can be present on a 
micro-scale. Thus tie-lines across the silica- mullite 
field usually only give approximate proportions of 
these two phases. The presence of traces of catalysing 
mineralizers, such as lime, can make application of the 
diagram nominal rather than rigorous. For instance, 
although silica bricks are fired at a temperature of 
1450°C, which is within the stability range of tridymite 
(870-1470°C), cristobalite is able to form in quantity. 
However, during service, true stability is approached 
and a silica brick operating in a temperature gradient 
will develop clearly-defined and separate zones of 
tridymite and cristobalite. 

By tradition, refractories are often said to be acid 
or basic, indicating their suitability for operation in 
contact with acid (Si0 2 -rich) or basic (CaO- or FeO- 
rich) slags. For instance, suppose that conditions are 
reducing and the lower oxide of iron, FeO, forms 
in a basic steel-making slag (1600°C). ‘Acid’ silica 
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refractory will be rapidly destroyed because this fer- 
rous oxide reacts with silica to form fayalite, Fe 2 Si0 4 , 
which has a melting point of 1180°C. (The Si0 2 -FeO 
phase diagram shows a sudden fall in the liquidus.) 
However, in certain cases, this approach is scientifi- 
cally inadequate. For instance, ‘acid’ silica also has a 
surprising tolerance for basic CaO-rich slags. Refer- 
ence to the Si0 2 -CaO diagram reveals that there is a 
monotectic plateau at its silica-rich end, a feature that 
is preferable to a steeply descending liquidus. Its exis- 
tence accounts for the slower rate of attack by molten 
basic slag and also, incidentally, for the feasibility of 
using lime as a bonding agent for silica grains during 
firing. 

3.2. 8. 5 Nickel-sulphur-oxygen and 
chromium- sulphur-oxygen systems 
The hot corrosion of superalloys based upon nickel, 
iron or cobalt by flue or exhaust gases from the 
combustion of sulphur-containing fuels is a problem 
common to a number of industries (e.g. power 
generation). These gases contain nitrogen, oxygen 
(excess to stoichiometric combustion requirements), 
carbon dioxide, water vapour, sulphur dioxide, sulphur 
trioxide, etc. In the case of a nickel-based alloy, the 
principal corrosive agents are sulphur and oxygen. 
They form nickel oxide and/or sulphide phases at the 
flue gas/alloy interface: their presence represents metal 
wastage. A phase diagram for the Ni-S-0 system, 
which makes due allowance for the pressure variables, 
provides a valuable insight into the thermochemistry of 
attack of a Ni-based superalloy. Although disregarding 
kinetic factors, such as diffusion, a stability diagram 
of this type greatly helps understanding of underlying 
mechanisms. Primarily, it indicates which phases are 
likely to form. Application of these diagrams to hot 
corrosion phenomena is discussed in Chapter 12. 

Under equilibrium conditions, the variables gov- 
erning chemical reaction at a nickel/gas interface are 
temperature and the partial pressures p 0l and p S2 for 
the gas phase. For isothermal conditions, the gen- 
eral disposition of phases will be as shown schemat- 
ically in Figure 3.25a. An isothermal section (900 K) 
is depicted in Figure 3.25b. A comprehensive three- 
dimensional representation, based upon standard free 
energy data for the various competing reactions, 
is given in Figure 3.26. Section AA is isothermal 
(1200 K): the full diagram may be regarded as a par- 
allel stacking of an infinite number of such vertical 
sections. From the Phase Rule, P + F = C + 2, it fol- 
lows that F = 5 — P. Hence, for equilibrium between 
gas and one condensed phase, there are three degrees 
of freedom and equilibrium is represented by a vol- 
ume. Similarly, equilibrium between gas and three 
condensed phases is represented by a line. The bivari- 
ant and univariant equilibrium equations which form 
the basis of the three-dimensional stability diagram are 
given in Figure 3.26. 


Gaseous 

Phase 

'/SondensedP/ 
V/ Phase // 
Heterogeneous 
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(a) 




(b) log Pcyatmos. — 

Figure 3.25 (a) General disposition of phases in 
Ni-S-0 system; (b) isothermal section at temperature of 
900 K (after Quets and Dresher, 1969, pp. 583-599). 


3.2.9 Ternary phase diagrams 

3.2.9. 1 The ternary prism 

Phase diagrams for three-component systems usually 
take the standard form of a prism which combines an 
equilateral triangular base (ABC) with three binary 
system ‘walls’ (A-B, B-C, C-A), as shown in 
Figure 3.27a. This three-dimensional form allows the 
three independent variables to be specified, i.e. two 
component concentrations and temperature. As the 
diagram is isobaric, the modified Phase Rule applies. 
The vertical edges can represent pure components of 
either metallic or ceramic systems. Isothermal contour 
lines are helpful means for indicating the curvature of 
liquidus and solidus surfaces. 

Figure 3.27b shows some of the ways in which the 
base of the prism, the Gibbs triangle, is used. For 
instance, the recommended method for deriving the 
composition of a point P representing a ternary alloy 
is to draw two construction lines to cut the nearest 
of the three binary composition scales. In similar 
fashion, the composition of the phases at each end 
of the tie-line passing through P can be derived. Tie- 
triangles representing three-phase equilibria commonly 
appear in horizontal (isothermal) sections through the 
prism. The example in Figure 3.27b shows equilibrium 
between a, f$ and y phases for an alloy of average 
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Figure 3.28 Derivation of vertical and horizontal sections: (a) lines of construction, (b) vertical section (~20% C), 
(c) isothermal section at temperature T 2 . 


composition Q. Each side of the tie-triangle is a 
tie-line. Applying the centre-of-gravity principle, the 
weight proportion of any phase in the mixture can 
be obtained by drawing a line and measuring two 
lengths (e.g. % ot = QR/Ra x 100). Congruently- 
melting phases can be linked together by a join- 
line, thus simplifying interpretation. For example, 
component B may be linked to the intermediate 
phase A 2 C. Any line originating at a pure component 
also provides a constant ratio between the other two 
components along its length. 

Tie-lines can be drawn across two-phase regions of 
isothermal sections but, unlike equivalent tie-lines in 
binary systems, they do not necessarily point directly 
towards pure components: their exact disposition has to 
be determined by practical experiment. It follows that 
when vertical sections (isopleths) are taken through the 
prism, insertion of tie-lines across a two-phase field is 
not always possible because they may be inclined to 
the vertical plane. For related reasons, one finds that 
three-phase regions in vertical sections have slightly 
curved sides. 

3. 2. 9. 2 Complete solid miscibility 
In the ternary diagram of Figure 3.27a the volume 
of continuous solid solubility ( a ) is separated from 
the liquid phase by a two-phase ( a + Fiquid) zone of 
convex-lens shape. Analysis of this diagram provides 
an insight into the compositional changes attending 
the freezing/melting of a ternary solid solution. It is 
convenient to fold down the binary ‘walls’ so that 


construction lines for isothermal and vertical sections 
can be prepared (Figure 3.28a). A vertical section, 
which provides guidance on the freezing and melting 
of all alloys containing the same amount of component 
C, is shown in Figure 3.28b. 

Assuming that an alloy of average composition 
X freezes over a temperature range of T \ to r 4 , 
Figure 3.28c shows an isothermal section for a 
temperature T 2 just below the liquidus. The two-phase 
field in this section is bivariant. As the proportions 
and compositions of a and liquid L gradually change, 
the tie-lines for the four temperatures change their 
orientation. This rotational effect, which is shown 
by projecting the tie-lines downward onto the basal 
triangle (Figure 3.29), occurs because the composition 
of the residual liquid L moves in the general 
direction of C, the component of lowest melting point, 
and the composition of a approaches the average 
composition X. 

3.2. 9. 3 Three-phase equilibria 
In a ternary system, a eutectic reaction such as 
Fiquid ^ a + is univariant (F = 1 ) and, unlike its 
binary equivalent, takes place over a range of tempera- 
ture. Its characteristics can be demonstrated by consid- 
ering the system shown in Figure 3.30a in which two 
binary eutectic reactions occur at different tempera- 
tures. The key feature of the diagram, a three-phase tri- 
angle (a + f + Fiquid), evolves from the upper binary 
eutectic horizontal and then appears to move down 
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Figure 3.29 Rotation of tie-lines during freezing of a 
ternary solid solution (a). 

three ‘guide rails’ until it finally degenerates into the 
lower binary eutectic horizontal. As the temperature 


falls, the eutectic reaction Liquid -> a + ft takes place; 
hence the leading vertex of the triangle represents the 
composition of the liquid phase. 

A vertical section between component A and the 
mid-point of the B-C ‘wall’ (Figure 3.30b) and 
an isothermal section taken when the tie-triangle 
is about halfway through its downward movement 
(Figure 3.30c) help to explain typical solidification 
sequences. The vertical section shows that when an 
alloy of composition X freezes, primary ft formation 
is followed by eutectic reaction over a range of 
temperature. The final microstructure consists of 
primary ft in a eutectic matrix of a and ft. If we 
now concentrate on the immediate surroundings of 
the isothermal triangle, as depicted in Figure 3.30d, 
primary deposition of p is represented by the tie-line 
linking ft and liquid compositions. (This type of tie-line 
rotates, as described previously.) The eutectic reaction 
(Liquid -> a + ft) starts when one leading edge of the 
triangle, in this case the tie-line L-fi, cuts point X and 
is completed when its trailing edge, the tie-line ot-f , 
cuts X. As X is traversed by tie-triangles, the relative 
amounts of the three phases can be derived for each 
isotherm. For alloy Y, lying on the valley line of the 




(c) C 



Figure 3.30 Three-phase equilibrium in a simple ternary system (after Rhine s, 1956). 
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eutectic reaction, it can be seen from both sections 
that no primary p forms and freezing only produces a 
mixture of a and p phases. 

These ideas can also be applied to three-phase peri- 
tectic reactions in ternary systems, i.e. a + Liquid ^ 
p. Being the converse form of the eutectic reaction, its 
tie-triangle has the (L + a) tie-line as the leading side 
and a trailing vertex. The ‘inflated’ triangle in vertical 
sections is inverted. 

3.2. 9.4 Four-phase equilibria 
The simplest form of four-phase equilibrium, referred 
to as Class I, is summarized by the ternary eutectic 
reaction, Liquid ^ a + p + y. This invariant condi- 
tion ( F = 0) is represented by the triangular plateau 
at the heart of Figure 3.31. The plateau itself is solid 
and can be regarded as a stack of three-phase triangles, 
i.e. a + p + y. Each of the three constituent binary 
systems is eutectiferous with limited solid solubility. 
It is necessary to visualize that, on cooling, each of 
the three eutectic horizontals becomes the independent 
source of a set of descending three-phase eutectic trian- 
gles which behave in the general manner described in 
the previous section. At the ternary eutectic level of the 
plateau, temperature T 6 , these three triangles coalesce 
and the ternary reaction follows. (The lines where tri- 
angles meet are needed when sections are drawn.) The 
stack of tie-triangles associated with each binary eutec- 
tic reaction defines a beak-shaped volume, the upper 
edge being a ‘valley’ line. Figure 3.31 shows three 
valley lines descending to the ternary eutectic point. 

A typical sequence of isothermal sections is shown 
in Figure 3.32 over the temperature range Ti~T 6 . As 
the temperature falls, the three eutectic tie-triangles 
appear in succession. The liquid field shrinks until, at 



Figure 3.31 Phase diagram for a ternary eutectic system 
(Class I four-phase equilibrium ) (after Rhines, 1956). 


temperature T 6 , the three triangles coalesce to form the 
larger triangle of four-phase equilibrium. Below this 
level, after ternary eutectic reaction, the three solid 
phases adjust their composition in accordance with 
their respective solvus lines. A vertical section which 
includes all alloys containing about 30% component 
C is shown in Figure 3.33. The three-phase plateau is 
immediately apparent. This section also intersects three 
of the binary eutectic reaction ‘beaks’, i.e. Liquid + 
a + p, Liquid + a + y and Liquid + p + y. 

We will consider the solidification of four alloys 
which are superimposed on the T 6 isothermal section 
of Figure 3.32. The simplest case of solidification is 
the liquid ternary alloy W, which transforms to three 
phases immediately below the plateau temperature T 6 . 
This type of alloy is the basis of fusible alloys which 
are used for special low m.p. applications (e.g. plugs 
for fire-extinguishing sprinkler systems). For example, 
a certain combination of lead (m.p. 327° C), tin (m.p. 
232°C) and bismuth (m.p. 269°C) melts at a temper- 
ature of 93 °C. On cooling, liquid alloy X will first 
decompose over a range of temperature to form a 
eutectic mixture (a + p) and then change in composi- 
tion along a ‘valley’ line until its residue is finally con- 
sumed in the ternary eutectic reaction. Liquid alloy Y 
will first deposit primary p and then, with rotation of 
(P + Liquid) tie-lines, become depleted in B until it 
reaches the nearby ‘valley’ line. Thereafter, it behaves 
like alloy X. In the special case of alloy Z, which lies 
on a construction line joining the ternary eutectic point 
to a comer of the ternary plateau, it will first form a pri- 
mary phase (y) and then undergo the ternary reaction: 
there will be no preliminary binary eutectic reaction. 

We will now outline the nature of two other four- 
phase equilibria. Figure 3.34a illustrates the Class II 
reaction, Liquid + a ^ p + y. As the temperature is 
lowered, two sets of descending tie-triangles repre- 
senting a peritectic-type reaction and a eutectic-type 
reaction, respectively, combine at the ternary reaction 
isotherm to form a kite- shaped plane. This plane then 
divides, as shown schematically in Figure 3.35a, form- 
ing a solid three-phase plateau and initiating a descend- 
ing set of eutectic triangles. Unlike Class I equilibrium, 
the composition which reacts with a-phase lies outside 
the limits of the top of the plateau. 

From Figure 3.34b, which illustrates Class III 
equilibrium, Liquid + a + p ^ y, a solid three-phase 
plateau is again a central feature. At the higher 
temperatures, a eutectic-type reaction (Liquid -> a + 
P) generates a stack of tie-triangles. At the critical 
temperature of the ternary peritectic reaction, a large 
triangle breaks up and two sets of peritectic-type 
triangles are initiated (Figure 3.35b). If the Class II 
and Class III ternary reaction sequences are compared 
with that of Class I, it will be seen that there 
is an inverse relation between eutectic Class I and 
the peritectic Class III. Class II is intermediate in 
character to Classes I and III. 








70 Modern Physical Metallurgy and Materials Engineering 



Figure 3.33 Vertical section through ternary system shown 
in Figure 3.31. 

3.2. 9. 5 Application to dielectric ceramics 
The phase diagram for the Mg 0 -Al 2 03 -Si 02 system 
(Figure 3.36) has proved extremely useful in providing 
guidance on firing strategies and optimum phase 
relations for important dielectric 1 ceramics. (The 
diagram is also relevant to basic steel-making 
refractories based on magnesia, i.e. periclase.) Its 
principal features are the straight join-lines, which 
link binary and/or ternary compounds, and the curving 

l A dielectric is a material that contains few or no free 
electrons and has a lower electrical conductivity than a 
metal. 



‘valley’ lines. Two junctions of these join-lines lie 
within the diagram, marking the ternary compounds 
sapphirine (M 4 A 5 S 2 ) and cordierite (M 2 A 2 S 5 ). The 
join-lines divide the projection into tie-triangles 
(sometimes termed compatibility triangles). These 
triangles enable the amounts and composition of stable 
phases to be calculated. The topology of the liquidus 
surface is always of prime interest. In this system, 
the lowest liquidus temperature (1345°C) is associated 
with the tridymite-protoenstatite- cordierite eutectic. 

In the complementary Figure 3.37, compositional 
zones for four classes of dielectric ceramic have been 
superimposed, i.e. forsterite ceramics, low-loss-factor 
steatites, steatite porcelains and cordierite ceramics. 
These fired ceramics originate from readily-workable 
clay/talc mixtures. The nominal formula for talc is 
Mg 3 (Si 2 0 5 ) 2 (0H) 2 . Fired clay can be regarded as mul- 
lite (A 3 S 2 ) plus silica, and fired talc as protoenstatite 
(MS) plus silica: accordingly, the zones are located 
toward the silica-rich comer of the diagram. Each fired 
product consists of small crystals in a glassy matrix 
(20-30%). The amount of glass must be closely con- 
trolled. Ideally, control of firing is facilitated when the 
amount of glass-forming liquid phase changes slowly 
with changing temperature. In this respect, the pres- 
ence of a steeply sloping liquidus is favourable (e.g. 
forsterite (Mg 2 Si 2 0 4 ) ceramics). Unfortunately, the 
other three materials tend to liquefy rather abruptly 
and to form too much liquid, making firing a poten- 
tially difficult operation. 



Figure 3.34 Phase diagrams illustrating (a) Class II equilibrium ( Liquid + a ^ f + y), and (b) Class III equilibrium 
( Liquid + a + (5 ^ y) (after Rhines, 1956). 
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Figure 3.35 Class II and Class III equilibria in ternary systems. 
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Figure 3.36 Basal projection for MgO-Al 2 O 3 -Si 02 system ; regions of solid solution not shown (from Keith and Schairer, 
1952; by permission of University of Chicago Press). 
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A3S 2 = mullite 



Figure 3.37 Location of steatites, cordierite and forsterite in Figure 3.36 (after King ery, Bowen and Uhlmann, 1976; by 
permission of Wiley-lnterscience ) . 


Let us consider four representative compositions A, 
B, C and D in more detail (Figure 3.37). The steatite A 
is produced from a 90% talc -10% clay mixture. Pure 
talc liquefies very abruptly and clay is added to mod- 
ify this undesirable feature; even then firing conditions 
are critical. Additional magnesia is used in low-loss 
steatites (B) for the same purpose but, again, firing is 
difficult to control. During cooling after firing, protoen- 
statite converts to clinoenstatite: small crystals of the 
latter are embedded in a glassy matrix. Low-loss-factor 
steatites also have a relatively high dielectric constant 
and are widely used for high-frequency insulators. 
Cordierite ceramics (C) are thermally shock-resistant, 
having a low coefficient of thermal expansion. Despite 
their restricted firing range, they find use as electrical 
resistor supports and burner tips. Fluxes can be used 


to extend the freezing range of steatites and cordierites 
but the electrical properties are likely to suffer. In sharp 
contrast, forsterite ceramics (D), which are also suit- 
able for high-frequency insulation, have a conveniently 
wide firing range. 

When considering the application of phase diagrams 
to ceramics in general, it must be recognized that 
ceramic structures are usually complex in character. 
Raw materials often contain trace impurities which will 
shift boundaries in phase diagrams and influence rates 
of transformation. Furthermore, metastable glass for- 
mation is quite common. Determination of the actual 
phase diagrams is difficult and time-consuming; con- 
sequently, experimental work often focuses upon a 
specific problem or part of a system. Against this back- 
ground, in circumstances where detailed information 
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on phases is sought, it is advisable to refer back to 
the experimental conditions and data upon which the 
relevant phase diagram are based. 

3.3 Principles of alloy theory 

3.3.1 Primary substitutional solid solutions 

3 . 3 . 1 . 1 The Hume-Rothery rules 

The key phase diagrams outlined in Section 3.2.8 
exhibit many common features (e.g. primary solid 
solutions, intermediate phases) and for systems based 
on simple metals some general rules 1 governing the 
formation of alloys have been formulated. These 
rules can form a useful basis for predicting alloying 
behaviour in other more complex systems. 

In brief the rules for primary solid solubility are as 
follows: 

1. Atomic size factor — If the atomic diameter of the 
solute atom differs by more than 15% from that 
of the solvent atom, the extent of the primary 
solid solution is small. In such cases it is said that 
the size-factor is unfavourable for extensive solid 
solution. 

1 These are usually called the Hume-Rothery rules because it 
was chiefly W. Hume-Rothery and his colleagues who 
formulated them. 
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Figure 3.38 Effect of size factor on the form of the equilibrium 
(c) Au-Ni, and (d) Cu-Ag. 


2. Electrochemical effect — The more electropositive 
the one component and the more electronegative 
the other, the greater is the tendency for the two 
elements to form compounds rather than extensive 
solid solutions. 

3. Relative valency effect — A metal of higher valency 
is more likely to dissolve to a large extent in one 
of lower valency than vice versa. 

3.3. 1.2 Size-factor effect 

Two metals are able to form a continuous range of solid 
solutions only if they have the same crystal structure 
(e.g. copper and nickel). However, even when the 
crystal structure of the two elements is the same, the 
extent of the primary solubility is limited if the atomic 
size of the two metals, usually taken as the closest 
distance of approach of atoms in the crystal of the 
pure metal, is unfavourable. This is demonstrated in 
Figure 3.38 for alloy systems where rules 2 and 3 have 
been observed, i.e. the electrochemical properties of 
the two elements are similar and the solute is dissolved 
in a metal of lower valency. As the size difference 
between the atoms of the two component metals A and 
B approaches 15%, the equilibrium diagram changes 
from that of the copper-nickel type to one of a eutectic 
system with limited primary solid solubility. 

The size-factor effect is due to the distortion 
produced in the parent lattice around the dissolved 
misfitting solute atom. In these localized regions the 
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(d) 

diagram; examples include (a) Cu-Ni, Au-Pt, (b) Ni-Pt, 
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interatomic distance will differ from that given by the 
minimum in the E-r curve of Figure 6.2, so that the 
internal energy and hence the free energy, G, of the 
system is raised. In the limit when the lattice distortion 
is greater than some critical value the primary solid 
solution becomes thermodynamically unstable relative 
to some other phase. 

3.3. 1.3 Electrochemical effect 
This effect is best demonstrated by reference to the 
alloying behaviour of an electropositive solvent with 
solutes of increasing electronegativity. The electroneg- 
ativity of elements in the Periodic Table increases from 
left to right in any period and from bottom to top in any 
group. Thus, if magnesium is alloyed with elements 
of Group IV the compounds formed, Mg 2 (Si, Sn or 
Pb), become more stable in the order lead, tin, silicon, 
as shown by their melting points, 550°C, 778°C and 
1085°C, respectively. In accordance with rule 2 the 
extent of the primary solid solution is small (~7.75 
at.%, 3.35 at.%, and negligible, respectively, at the 
eutectic temperature) and also decreases in the order 
lead, tin, silicon. Similar effects are also observed with 
elements of Group V, which includes the elements bis- 
muth, antimony and arsenic, when the compounds Mg 3 
(Bi, Sb or As) 2 are formed. 

The importance of compound formation in control- 
ling the extent of the primary solid solution can be 
appreciated by reference to Figure 3.39, where the 
curves represent the free-energy versus composition 
relationship between the a-phase and compound at 
a temperature T. It is clear from Figure 3.39a that 
at this temperature the o'-phase is stable up to a 
composition c \ , above which the phase mixture ( a + 
compound) has the lower free energy. When the com- 
pound becomes more stable, as shown in Figure 3.39b, 
the solid solubility decreases, and correspondingly the 
phase mixture is now stable over a greater composition 
range which extends from c 3 to c 4 . 
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The above example is an illustration of a more gen- 
eral principle that the solubility of a phase decreases 
with increasing stability, and may also be used to show 
that the concentration of solute in solution increases 
as the radius of curvature of the precipitate particle 
decreases. Small precipitate particles are less stable 
than large particles and the variation of solubility with 
particle size is recognized in classical thermodynamics 
by the Thomson -Freundlich equation 

ln[c(r)/c] = 2 yQ/kTr (3.7) 

where c(r) is the concentration of solute in equilibrium 
with small particles of radius r, c the equilibrium con- 
centration, y the precipitate/matrix interfacial energy 
and £2 the atomic volume (see Chapter 8). 

3.3. 1.4 Relative valency effect 

This is a general rule for alloys of the univalent metals, 
copper, silver and gold, with those of higher valency. 
Thus, for example, copper will dissolve approximately 
40% zinc in solid solution but the solution of copper 
in zinc is limited. For solvent elements of higher 
valencies the application is not so general, and in fact 
exceptions, such as that exhibited by the magnesium- 
indium system, occur. 

3. 3. 1.5 The primary solid solubility boundary 

It is not yet possible to predict the exact form of the 
o'-solid solubility boundary, but in general terms the 
boundary may be such that the range of primary solid 
solution either (1) increases or (2) decreases with rise 
in temperature. Both forms arise as a result of the 
increase in entropy which occurs when solute atoms 
are added to a solvent. It will be remembered that this 
entropy of mixing is a measure of the extra disorder 
of the solution compared with the pure metal. 

The most common form of phase boundary is 
that indicating that the solution of one metal in 
another increases with rise in temperature. This follows 
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Figure 3.39 Influence of compound stability on the solubility limit of the a phase at a given temperature. 
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from thermodynamic reasoning since increasing the 
temperature favours the structure of highest entropy 
(because of the —TS term in the relation G = H — TS) 
and in alloy systems of the simple eutectic type an 
o'-solid solution has a higher entropy than a phase 
mixture (a + /3). Thus, if the alloy exists as a phase 
mixture (a + /3) at the lower temperatures, it does 
so because the value of H happens to be less for 
the mixture than for the homogeneous solution at 
that composition. However, because of its greater 
entropy term, the solution gradually becomes preferred 
at high temperatures. In more complex alloy systems, 
particularly those containing intermediate phases of 
the secondary solid solution type (e.g. copper- zinc, 
copper- gallium, copper- aluminium, etc.), the range 
of primary solid solution decreases with rise in 
temperature. This is because the /3-phase, like the a- 
phase, is a disordered solid solution. However, since it 
occurs at a higher composition, it has a higher entropy 
of mixing, and consequently its free energy will fall 
more rapidly with rise in temperature. This is shown 
schematically in Figure 3.40. The point of contact on 
the free energy curve of the a-phase, determined by 
drawing the common tangent to the a and /3 curves, 
governs the solubility c at a given temperature T. The 
steep fall with temperature of this common tangent 
automatically gives rise to a decreasing solubility limit. 

Many alloys of copper or silver reach the limit of 
solubility at an electron to atom ratio of about 1.4. 
The divalent elements zinc, cadmium and mercury 


have solubilities of approximately 40 at.% 1 (e.g. 
copper-zinc, silver- cadmium, silver -mercury), the 
trivalent elements approximately 20 at.% (e.g. cop- 
per-aluminium, copper- gallium, silver -aluminium, 
silver -indium) and the tetravalent elements about 
13 at.% (e.g. copper- germanium, copper- silicon, sil- 
ver-tin), respectively. 

The limit of solubility has been explained by Jones 
in terms of the Brillouin zone structure (see Chapter 6). 
It is assumed that the density of states -energy curve 
for the two phases, a (the close-packed phase) and 
^ (the more open phase), is of the form shown in 
Figure 3.41, where the N(E) curve deviates from the 
parabolic relationship as the Fermi surface approaches 
the zone boundary. 2 As the solute is added to the 
solvent lattice and more electrons are added the top 
of the Fermi level moves towards A, i.e. where the 

x For example, a copper-zinc alloy containing 40 at.% zinc 
has an e/a ratio of 1.4, i.e. for every 100 atoms, 60 are 
copper each contributing one valency electron and 40 are 
zinc each contributing two valency electrons, so that 
e/a = (60 x 1 + 40 x 2)/100 = 1.4. 

2 The shape of the Fermi surface may be determined from 
measurements of physical properties as a function of 
orientation in a single crystal. The surface resistance to a 
high-frequency current at low temperatures (the anomalous 
skin effect) shows that in copper the Fermi surface is 
distorted from the spherical shape but becomes more nearly 
spherical in copper alloys. 
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Figure 3.40 (a) The effect of temperature on the relative positions of the a- and f> -phase free energy curves for an alloy system 
having a primary solid solubility of the form shown in (b). 
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Figure 3.41 Density of states versus energy diagram. 




Figure 3.42 (a) BCC lattice showing the relative positions 
of the main lattice sites, the octahedral interstices marked O, 
and the tetrahedral interstices marked T. (b) Structure cell 
of iron showing the distortions produced by the two different 
interstitial sites. Only three of the iron atoms surrounding 
the octahedral sites are shown; the fourth, centred at A, has 
been omitted for clarity ( after Williamson and Smallman, 
1953). 


density of states is high and the total energy E for a 
given electron concentration is low. Above this point 
the number of available energy levels decreases so 
markedly that the introduction of a few more electrons 
per atom causes a sharp increase in energy. Thus, 
just above this critical point the a structure becomes 
unstable relative to the alternative /3 structure which 
can accommodate the electrons within a smaller energy 
range, i.e. the energy of the Fermi level is lower if 
the /3-phase curve is followed rather than the a-phase 
curve. The composition for which E max reaches the 
point E a is therefore a critical one, since the alloy will 
adopt that phase which has the lowest energy. It can 
be shown that this point corresponds to an electron-to- 
atom ratio of approximately 1.4. 

3.3.2 Interstitial solid solutions 

Interstitial solid solutions are formed when the solute 
atoms can fit into the interstices of the lattice of the sol- 
vent. However, an examination of the common crystal 
lattices shows that the size of the available interstices is 
restricted, and consequently only the small atoms, such 
as hydrogen, boron, carbon or nitrogen, with atomic 
radii very much less than one nanometre form such 
solutions. The most common examples occur in the 
transition elements and in particular the solution of 
carbon or nitrogen in iron is of great practical impor- 
tance. In fee iron (austenite) the largest interstice or 
‘hole’ is at the centre of the unit cell (coordinates 
\) where there is space for an atom of radius 52 pm, 
i.e. 0.4 lr if r is the radius of the solvent atom. A 
carbon atom (80 pm (0.8 A) diameter) or a nitrogen 
atom (70 pm diameter) therefore expands the lattice 
on solution, but nevertheless dissolves in quantities 
up to 1.7 wt% and 2.8 wt%, respectively. Although 
the bcc lattice is the more open structure the largest 
interstice is smaller than that in the fee. In bcc iron 
(ferrite) the largest hole is at the position (^, 0) 

and is a tetrahedral site where four iron atoms are 


situated symmetrically around it; this can accommo- 
date an atom of radius 36 pm, i.e. 0.29r, as shown 
in Figure 3.42a. However, internal friction and X-ray 
diffraction experiments show that the carbon or nitro- 
gen atoms do not use this site, but instead occupy 
a smaller site which can accommodate an atom only 
0.154r, or 19 pm. This position (0, 0, \) at the mid- 
points of the cell edges is known as the octahedral site 
since, as can be seen from Figure 3.42b, it has a dis- 
torted octahedral symmetry for which two of the iron 
atoms are nearer to the centre of the site than the other 
four nearest neighbours. The reason for the interstitial 
atoms preferring this small site is thought to be due to 
the elastic properties of the bcc lattice. The two iron 
atoms which lie above and below the interstice, and 
which are responsible for the smallness of the hole, 
can be pushed away more easily than the four atoms 
around the larger interstice. As a result, the solution 
of carbon in of-iron is extremely limited (0.02 wt%) 
and the structure becomes distorted into a body-centred 
tetragonal lattice. The c axis for each interstitial site 
is, however, disordered, so that this gives rise to a 
structure which is statistically cubic. The body-centred 
tetragonal structure forms the basis of martensite (an 
extremely hard metastable constituent of steel), since 
the quenching treatment given to steel retains the car- 
bon in supersaturated solution (see Chapter 8). 

3.3.3 Types of intermediate phases 

3.3.3. 1 Electrochemical compounds 
The phases which form in the intermediate compo- 
sition regions of the equilibrium diagram may be 
(1) electrochemical or full-zone compounds, (2) size- 
factor compounds, or (3) electron compounds. The 
term ‘compound’ still persists even though many of 
these phases do not obey the valency laws of chemistry 
and often exist over a wide composition range. 

We have already seen that a strong tendency 
for compound formation exists when one element 
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is electropositive and the other is electronegative. 
The magnesium-based compounds are probably 
the most common examples having the formula 
Mg 2 (Pb, Sn, Ge or Si). These have many features 
in common with salt-like compounds since their 
compositions satisfy the chemical valency laws, their 
range of solubility is small, and usually they have 
high melting points. Moreover, many of these types of 
compounds have crystal structures identical to definite 
chemical compounds such as sodium chloride, NaCl, 
or calcium fluoride, CaF 2 . In this respect the Mg 2 X 
series are anti-isomorphous with the CaF 2 fluorspar 
structure, i.e. the magnesium metal atoms are in the 
position of the non-metallic fluoride atoms and the 
metalloid atoms such as tin or silicon take up the 
position of the metal atoms in calcium fluoride. 

Even though these compounds obey all the chemical 
principles they may often be considered as special 
electron compounds. For example, the first Brillouin 
zone 1 of the CaF 2 structure is completely filled at 
| electrons per atom, which significantly is exactly 
that supplied by the compound Mg 2 Pb, Sn, . . ., etc. 
Justification for calling these full-zone compounds is 
also provided by electrical conductivity measurements. 
In contrast to the behaviour of salt-like compounds 
which exhibit low conductivity even in the liquid state, 
the compound Mg 2 Pb shows the normal conduction 
(which indicates the possibility of zone overlapping) 
while Mg 2 Sn behaves like a semiconductor (indicating 
that a small energy gap exists between the first and 
second Brillouin zones). 

In general, it is probable that both concepts are nec- 
essary to describe the complete situation. As we shall 
see in Section 3. 3. 3. 3, with increasing electrochemical 
factor even true electron compounds begin to show 
some of the properties associated with chemical com- 
pounds, and the atoms in the structure take up ordered 
arrangements. 

3. 3. 3. 2 Size-factor compounds 
When the atomic diameters of the two elements dif- 
fer only slightly, electron compounds are formed, as 
discussed in the next section. However, when the dif- 
ference in atomic diameter is appreciable, definite size- 
factor compounds are formed which may be of the 
(1) interstitial or (2) substitutional type. 

A consideration of several interstitial solid solutions 
has shown that if the interstitial atom has an atomic 
radius 0.41 times that of the metal atom then it can fit 
into the largest available structural interstice without 
distortion. When the ratio of the radius of the intersti- 
tial atom to that of the metal atom is greater than 0.41 
but less than 0.59, interstitial compounds are formed; 
hydrides, borides, carbides and nitrides of the transi- 
tion metals are common examples. These compounds 
usually take up a simple structure of either the cubic 

brillouin zones and electrical conductivity are dealt with in 
Chapter 6. 


or hexagonal type, with the metal atoms occupying 
the normal lattice sites and the non-metal atoms the 
interstices. In general, the phases occur over a range 
of composition which is often centred about a simple 
formula such as M 2 X and MX. Common examples are 
carbides and nitrides of titanium, zirconium, hafnium, 
vanadium, niobium and tantalum, all of which crys- 
tallize in the NaCl structure. It is clear, therefore, that 
these phases do not form merely as a result of the 
small atom fitting into the interstices of the solvent 
structure, since vanadium, niobium and tantalum are 
bcc, while titanium, zirconium and hafnium are cph. 
By changing their structure to fee the transition metals 
allow the interstitial atom not only a larger ‘hole’ but 
also six metallic neighbours. The formation of bonds 
in three directions at right angles, such as occurs in 
the sodium chloride arrangement, imparts a condition 
of great stability to these MX carbides. 

When the ratio r (interstitial ) to r (met a p exceeds 0.59 
the distortion becomes appreciable, and consequently 
more complicated crystal structures are formed. Thus, 
iron nitride, where r N /r Fe = 0.56, takes up a structure 
in which nitrogen lies at the centre of six atoms as 
suggested above, while iron carbide, i.e. cementite, 
Fe 3 C, for which the ratio is 0.63, takes up a more 
complex structure. 

For intermediate atomic size difference, i.e. about 
20-30%, an efficient packing of the atoms can be 
achieved if the crystal structure common to the Laves 
phases is adopted (Table 3.2). These phases, classified 
by Laves and his co-workers, have the formula AB 2 
and each A atom has 12 B neighbours and 4 A neigh- 
bours, while each B atom is surrounded by six like 
and six unlike atoms. The average coordination num- 
ber of the structure (13.33) is higher, therefore, than 
that achieved by the packing of atoms of equal size. 
These phases crystallize in one of three closely related 
structures which are isomorphous with the compounds 
MgCu 2 (cubic), MgNi 2 (hexagonal) or MgZn 2 (hexag- 
onal). The secret of the close relationship between 
these structures is that the small atoms are arranged 
on a space lattice of tetrahedra. 

The different ways of joining such tetrahedra 
account for the different structures. This may be 
demonstrated by an examination of the MgCu 2 
structure. The small B atoms lie at the comers of 
tetrahedra which are joined point-to-point throughout 
space, as shown in Ligure 3.43a. Such an arrangement 
provides large holes of the type shown in Ligure 3.43b 
and these are best filled when the atomic ratio 
Aiar g e)/Asmaii) = 1.225. The complete cubic stmeture 
of MgCu 2 is shown in Ligure 3.43c. The MgZn 2 
stmeture is hexagonal, and in this case the tetrahedra 
are joined alternately point-to-point and base-to-base 
in long chains to form a wurtzite type of stmeture. 
The MgNi 2 structure is also hexagonal and although 
very complex it is essentially a mixture of both the 
MgCu 2 and MgNi 2 types. 

The range of homogeneity of these phases is narrow. 
This limited range of homogeneity is not due to any 
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Figure 3.43 (a) Framework of the MgCu 2 structure, (b) Shape of hole in which large Mg atom is accommodated. 

(c) Complete MgCu 2 structure (after Hume -Rothery, Smallman and Howorth, 1969; by courtesy of the Institute of Metals). 


Table 3.2 Compounds which exist in a Laves phase structure 


MgCu 2 type 

MgNi 2 type 

MgZn 2 type 

AgBe 2 

BiAu2 

BaMg 2 

> 


NbCo2 

Nb(Mn of Fe) 2 

NbCo 2 1 

I with 

TaCc >2 

TaMn2 

TaCo 2 1 

l excess of 

Ti (Be, Co, or Cr )2 

Ti (Mn or Fe )2 

TiCo 2 

| B metal 

U (Al, Co, Fe or Mn) 2 

UNi 2 

ZrFe 2 J 

1 

Zr (Co, Fe, or W) 2 

Zr (Cr, Ir, Mn, Re, Ru, Os or V )2 




ionic nature of the compound, since ionic compounds 
usually have low coordination numbers whereas. Laves 
phases have high coordination numbers, but because 
of the stringent geometrical conditions governing the 
structure. However, even though the chief reason for 
their existence is that the ratio of the radius of the 
large atom to that of the small is about 1.2, there are 
indications that electronic factors may play some small 
part. For example, provided the initial size-factor con- 
dition is satisfied then if the e/a ratio is high (e.g. 
2), there is a tendency for compounds to crystallize 
in the MgZn 2 structure, while if the e/a ratio is low 
(e.g. |), then there is a tendency for the MgCu 2 type 
of structure to be formed. This electronic feature is 
demonstrated in the magnesium -nickel -zinc ternary 
system. Thus, even though the binary systems contain 
both the MgZn 2 and MgNi 2 phases the ternary com- 
pound MgNiZn has the MgCu 2 structure, presumably 
because its e/a ratio is |. Table 3.2 shows a few com- 
mon examples of each type of Laves structure, from 
which it is evident that there is also a general tendency 
for transition metals to be involved. 

3. 3. 3. 3 Electron compounds 
Alloys of copper, silver and gold with the B subgroup 
all possess the sequence a, /3, y, e of structurally 
similar phases, and while each phase does not occur at 
the same composition when this is measured in weight 
per cent or atomic per cent, they do so if composition 


is expressed in terms of electron concentration. Hume- 
Rothery and his co-workers have pointed out that 
the e/a ratio is important not only in governing the 
limit of the o'- solid solution but also in controlling 
the formation of certain intermediate phases; for this 
reason they have been termed ‘electron compounds’. 

In terms of those phases observed in the cop- 
per-zinc system (Figure 3.20), /3-phases are found at 
an e/a ratio of \ and these phases are often either 
disordered bcc in structure or ordered CsCl-type, p'. 
In the copper- aluminium system for example, the p- 
structure is found at Cu 3 A1, where the three valency 
electrons from the aluminium and the one from each 
copper atom make up a ratio of 6 electrons to 4 atoms, 
i.e. e/a = |. Similarly, in the copper-tin system the 
/3-phase occurs at CusSn with 9 electrons to 6 atoms 
giving the governing e/a ratio. The y -brass phase, 
Cu 5 Zn 8 , has a complex cubic (52 atoms per unit cell) 
structure, and is characterized by an e/a ratio of 
while the £ -brass phase, CuZn 3 , has a cph structure and 
is governed by an e/a ratio of \. A list of some of these 
structurally-analogous phases is given in Table 3.3. 

A close examination of this table shows that some 
of these phases, e.g. CusSi and Ag 3 Al, exist in dif- 
ferent structural forms for the same e/a ratio. Thus, 
Ag 3 Al is basically a | bcc phase, but it only exists 
as such at high temperatures; at intermediate temper- 
atures it is cph and at low temperatures /3-Mn. It is 
also noticeable that to conform with the appropriate 
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Table 3.3 Some selected structurally-analogous phases 


Electron-atom ratio 3:2 


Electron-atom ratio 21:13 
y -brass (complex cubic) 

Electron-atom 
ratio 7:4 

(3-brass (bcc) 

f-manganese 
( complex cubic ) 

(cph) 

s -brass (cph) 

(Cu, Ag or Au)Zn 
CuBe 

(Ag or Au) 3 Al 
CusSi 

AgZn 

AgCd 

(Cu, Ag or Au) (Zn or Cd)g 

CU9AI4 

(Cu, Ag or Au) (Zn or Cd) 3 

Cu 3 Sn 

(Ag or Au)Mg 

CoZn 3 

Ag 3 Al 


Cu 3 Si 

(Ag or Au)Cd 
(Cu or Ag) 3 Al 
(CusSn or Si) 

(Fe, Co or Ni)Al 


AusSn 

Cu 3 iSn 8 

(Fe, Co, Ni, Pd or Pt)5Zn 2 i 

Ag 5 Al 3 


electron-to-atom ratio the transition metals are cred- 
ited with zero valency. The basis for this may be found 
in their electronic structure which is characterized by 
an incomplete d - band below an occupied outermost s- 
band. The nickel atom, for example, has an electronic 
structure denoted by (2) (8) (16) (2), i.e. two elec- 
trons in the first quantum shell, eight in the second, 
sixteen in the third and two in the fourth shells, and 
while this indicates that the free atom has two valency 
electrons, it also shows two electrons missing from the 
third quantum shell. Thus, if nickel contributes valency 
electrons, it also absorbs an equal number from other 
atoms to fill up the third quantum shell so that the net 
effect is zero. 

Without doubt the electron concentration is the 
most important single factor which governs these 
compounds. However, as for the other intermediate 
phases, a closer examination shows an interplay of all 
factors. Thus, in general, the bcc \ compounds are only 
formed if the size factor is less than ±18%, an increase 
in the valency of the solute tends to favour cph and 
/3-Mn structures at the expense of the bcc structure, a 
high electrochemical factor leads to ordering up to the 
melting point, and an increase in temperature favours 
the bcc structure in preference to the cph or /3-Mn 
structure. 

3.3.4 Order -disorder phenomena 

A substitutional solid solution can be one of two 
types, either ordered in which the A and B atoms 
are arranged in a regular pattern, or disordered in 
which the distribution of the A and B atoms is random. 
From the previous section it is clear that the necessary 
condition for the formation of a superlattice, i.e. an 
ordered solid solution, is that dissimilar atoms must 
attract each other more than similar atoms. In addition, 
the alloy must exist at or near a composition which can 
be expressed by a simple formula such as AB, A 3 B or 
AB 3 . The following are common structures: 

1. CuZn While the disordered solution is bcc with 
equal probabilities of having copper or zinc atoms 
at each lattice point, the ordered lattice has copper 


atoms and zinc atoms segregated to cube comers 
(0, 0, 0) and centres (|, ^), respectively. The 

superlattice in the /3-phase therefore takes up the 
CsCl (also described as B2 or L2 0 ) stmcture as 
illustrated in Figure 3.44a. Other examples of the 
same type, which may be considered as being made 
up of two interpenetrating simple cubic lattices, are 
Ag(Mg, Zn or Cd), AuNi, NiAl, FeAl and FeCo. 

2. Cu 3 Au This structure, which occurs less frequently 
than the /3-brass type, is based on the fee structure 
with copper atoms at the centres of the faces (0, 

\) and gold atoms at the corners (0, 0, 0), as shown 
in Figure 3.44b. Other examples of the Ll 2 stmcture 
include Ni 3 Al, Ni 3 Ti, Ni 3 Si, Pt 3 Al, Fe 3 Ge, Zr 3 Al. 

3. AuCu The AuCu structure shown in Figure 3.44c is 
also based on the fee lattice, but in this case alternate 
(001) layers are made up of copper and gold atoms, 
respectively. Hence, because the atomic sizes of 
copper and gold differ, the lattice is distorted into 
a tetragonal structure having an axial ratio c/a = 
0.93. Other examples of the Ll 0 include CoPt, FePt 
and TiAl. 

4. Fe 3 Al Like FeAl, the Fe 3 Al structure is based on 
the bcc lattice but, as shown in Figure 3.44d, eight 
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Figure 3.44 Examples of ordered structures, (a) CuZn, 
(b) Cu 3 Au, (c) CuAu, (d) Fe 3 Al. 
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simple cells are necessary to describe the complete 
ordered arrangement. In this structure any individual 
atom is surrounded by the maximum number of 
unlike atoms and the aluminium atoms are arranged 
tetrahedrally in the cell. Other examples of the D0 3 
include Fe 3 Si and Cu 3 A1. 

5. Mg 3 Cd This ordered structure is based on the cph 
lattice. Other examples of the DO 19 structure are 
Ti 3 Al, MgCd 3 and Ni 3 Sn. 

Another important structure which occurs in certain 
intermetallics is the defect lattice. In the compound 
NiAl, as the composition deviates from stoichiome- 
try towards pure aluminium, the electron to atom ratio 
becomes greater than |, but to prevent the compound 
becoming unstable the lattice takes up a certain propor- 
tion of vacancies to maintain the number of electrons 
per unit cell at a constant value of 3. Such defects obvi- 
ously increase the entropy of the alloy, but the fact that 
these phases are stable at low temperatures, where the 
entropy factor is unimportant, demonstrates that their 
stability is due to a lowering of internal energy. Such 
defects produce an anomalous decrease in both the lat- 
tice parameter and the density above 50 at.% Al. 

3.4 The mechanism of phase changes 

3.4.1 Kinetic considerations 

Changes of phase in the solid state involve a redistri- 
bution of the atoms in that solid and the kinetics of 
the change necessarily depend upon the rate of atomic 
migration. The transport of atoms through the crystal 
is more generally termed diffusion, and is dealt with 
in Section 6.4. This can occur more easily with the 
aid of vacancies, since the basic act of diffusion is the 
movement of an atom to an empty adjacent atomic 
site. 

Let us consider that during a phase change an 
atom is moved from an a-phase lattice site to a more 
favourable /3-phase lattice site. The energy of the atom 
should vary with distance as shown in Figure 3.45, 
where the potential barrier which has to be overcome 
arises from the interatomic forces between the mov- 
ing atom and the group of atoms which adjoin it and 
the new site. Only those atoms ( n ) with an energy 



Figure 3.45 Energy barrier separating structural states. 


greater than Q are able to make the jump, where 
Qa^p = H m - H a and Qp^ a =H m -Hp are the activation 
enthalpies for heating and cooling, respectively. The 
probability of an atom having sufficient energy to jump 
the barrier is given, from the Maxwell -Boltzmann dis- 
tribution law, as proportional to exp [—Q/kT] where 
k is Boltzmann’s constant, T is the temperature and Q 
is usually expressed as the energy per atom in electron 
volts . 1 

The rate of reaction is given by 

Rate = A exp [—Q/kT] (3.8) 

where A is a constant involving n and v, the frequency 
of vibration. To determine Q experimentally, the reac- 
tion velocity is measured at different temperatures and, 
since 

In (Rate) = In A - Q/kT (3.9) 

the slope of the In (rate) versus 1 /T curve gives Q/k. 

In deriving equation (3.8), usually called an 
Arrhenius equation after the Swedish chemist who 
first studied reaction kinetics, no account is taken of 
the entropy of activation, i.e. the change in entropy 
as a result of the transition. In considering a general 
reaction the probability expression should be written 
in terms of the free energy of activation per atom F 
or G rather than just the internal energy or enthalpy. 
The rate equation then becomes 

Rate = A exp [—F/kT] 

= A exp [S/k] exp [-E/kT] (3.10) 

The slope of the In (rate) versus 1 /T curve then gives 
the temperature-dependence of the reaction rate, which 
is governed by the activation energy or enthalpy, and 
the magnitude of the intercept on the In (rate) axis 
depends on the temperature-independent terms and 
include the frequency factor and the entropy term. 

During the transformation it is not necessary for 
the entire system to go from a to /3 at one jump 
and, in fact, if this were necessary, phase changes 
would practically never occur. Instead, most phase 
changes occur by a process of nucleation and growth 
(cf. solidification, Section 3.1.1). Chance thermal fluc- 
tuations provide a small number of atoms with suffi- 
cient activation energy to break away from the matrix 
(the old structure) and form a small nucleus of the 
new phase, which then grows at the expense of the 
matrix until the whole structure is transformed. By this 
mechanism, the amount of material in the intermedi- 
ate configuration of higher free energy is kept to a 
minimum, as it is localized into atomically thin lay- 
ers at the interface between the phases. Because of 

! Q may also be given as the energy in J mol -1 in which 
case the rate equation becomes 

Rate of reaction = A exp [—Q/KT] 
where R = k N is the gas constant, i.e. 8.314 J mol -1 K _1 . 
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this mechanism of transformation, the factors which 
determine the rate of phase change are: (1) the rate 
of nucleation, N (i.e. the number of nuclei formed in 
unit volume in unit time) and (2) the rate of growth, 
G (i.e. the rate of increase in radius with time). Both 
processes require activation energies, which in general 
are not equal, but the values are much smaller than 
that needed to change the whole structure from a to p 
in one operation. 

Even with such an economical process as nucleation 
and growth transformation, difficulties occur and it is 
common to find that the transformation temperature, 
even under the best experimental conditions, is slightly 
higher on heating than on cooling. This sluggishness 
of the transformation is known as hysteresis, and is 
attributed to the difficulties of nucleation, since dif- 
fusion, which controls the growth process, is usually 
high at temperatures near the transformation tempera- 
ture and is, therefore, not rate-controlling. Perhaps the 
simplest phase change to indicate this is the solidifica- 
tion of a liquid metal. 

The transformation temperature, as shown on the 
equilibrium diagram, represents the point at which the 
free energy of the solid phase is equal to that of the 
liquid phase. Thus, we may consider the transition, as 
given in a phase diagram, to occur when the bulk or 
chemical free energy change, A G v , is infinitesimally 
small and negative, i.e. when a small but positive driv- 
ing force exists. However, such a definition ignores the 
process whereby the bulk liquid is transformed to bulk 
solid, i.e. nucleation and growth. When the nucleus is 
formed the atoms which make up the interface between 
the new and old phase occupy positions of compromise 
between the old and new structure, and as a result 
these atoms have rather higher energies than the other 
atoms. Thus, there will always be a positive free energy 
term opposing the transformation as a result of the 
energy required to create the surface of interface. Con- 
sequently, the transformation will occur only when the 
sum AG V + AG S becomes negative, where A G s arises 
from the surface energy of solid -liquid interface. Nor- 
mally, for the bulk phase change, the number of atoms 
which form the interface is small and A G s compared 
with A G v can be ignored. However, during nucleation 
A G v is small, since it is proportional to the amount 


transformed, and A G s , the extra free energy of the 
boundary atoms, becomes important due to the large 
surface area to volume ratio of small nuclei. Therefore 
before transformation can take place the negative term 
A G v must be greater than the positive term A G s and, 
since A G v is zero at the equilibrium freezing point, it 
follows that undercooling must result. 

3.4.2 Homogeneous nucleation 

Quantitatively, since A G v depends on the volume of 
the nucleus and A G s is proportional to its surface area, 
we can write for a spherical nucleus of radius r 

AG = (4nr 3 AG v /3) + 4jzr 2 y (3.11) 

where A G v is the bulk free energy change involved 
in the formation of the nucleus of unit volume and 
y is the surface energy of unit area. When the 
nuclei are small the positive surface energy term 
predominates, while when they are large the negative 
volume term predominates, so that the change in free 
energy as a function of nucleus size is as shown in 
Figure 3.46a. This indicates that a critical nucleus size 
exists below which the free energy increases as the 
nucleus grows, and above which further growth can 
proceed with a lowering of free energy; AG max may 
be considered as the energy or work of nucleation W. 
Both r c and W may be calculated since d AG /dr = 
\7ir 2 AG v + 871 ry = 0 when r = r c and thus r c = 
—2y/AG v . Substituting for r c gives 

W = \6ny 3 / 3 AG 2 (3.12) 

The surface energy factor y is not strongly dependent 
on temperature, but the greater the degree of under- 
cooling or supersaturation, the greater is the release 
of chemical free energy and the smaller the critical 
nucleus size and energy of nucleation. This can be 
shown analytically since AG V = AH — T AS, and at 
T = T e , AG V = 0, so that AH = T e AS. It therefore 
follows that 

A G v = (T e - T)AS = AT AS 
and because A G v a AT, then 

W <x y 3 /AT 2 (3.13) 




Figure 3.46 (a) Effect of nucleus size on the free energy of nucleus formation, (b) Effect of undercooling on the rate of 
precipitation. 
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Consequently, since nuclei are formed by thermal fluc- 
tuations, the probability of forming a smaller nucleus is 
greatly improved, and the rate of nucleation increases 
according to 

Rate = A exp [—Q/kT] exp [— AG max /kT 

= A exp [— (Q + AG max )/kT] (3.14) 

The term exp [—Q/kT] is introduced to allow for 
the fact that rate of nucleus formation is in the limit 
controlled by the rate of atomic migration. Clearly, 
with very extensive degrees of undercooling, when 
AG max g, the rate of nucleation approaches exp 
[—Q/kT] and, because of the slowness of atomic 
mobility, this becomes small at low temperature 
(Figure 3.46b). While this range of conditions can 
be reached for liquid glasses the nucleation of liquid 
metals normally occurs at temperatures before this 
condition is reached. (By splat cooling, small droplets 
of the metal are cooled very rapidly (10 5 K s _1 ) and 
an amorphous solid may be produced.) Nevertheless, 
the principles are of importance in metallurgy since 
in the isothermal transformation of eutectoid steel, for 
example, the rate of transformation initially increases 
and then decreases with lowering of the transformation 
temperature ( see TTT curves, Chapter 8). 

3.4.3 Heterogeneous nucleation 

In practice, homogeneous nucleation rarely takes place 
and heterogeneous nucleation occurs either on the 
mould walls or on insoluble impurity particles. From 
equation (3.13) it is evident that a reduction in the 
interfacial energy y would facilitate nucleation at small 
values of AT. Figure 3.47 shows how this occurs at 
a mould wall or pre-existing solid particle, where the 
nucleus has the shape of a spherical cap to minimize 
the energy and the ‘wetting’ angle 0 is given by the 
balance of the interfacial tensions in the plane of the 
mould wall, i.e. cos6> = (y M l - KsmVKsl- 

The formation of the nucleus is associated with an 
excess free energy given by 

AG = V AG V + AslKsl + AsmKsm — AsmXml 
= 7t/3(2 — 3 cos 6 + cos 3 6)r 3 AG V 
+ 2n{\ - cos 0)r 2 ysh 

+ 7tr 2 sin 2 e(y S M - Klm ) (3.15) 

Differentiation of this expression for the maximum, i.e. 
dAG/dr = 0, gives r c = — 2y S h/AG v and 

W = (16ttk 3 /3AG„ 2 )[(1 - cos#) 2 (2 + cos#)/4] 

(3.16) 

or 



N 




Figure 3.47 Schematic geometry of heterogeneous 
nucleation. 

heterogeneous nucleation. When 0 = 180°, no wetting 
occurs and there is no reduction in W ; when 0^0° 
there is complete wetting and W -> 0; and when 
0 < 0 < 1 80° there is some wetting and W is reduced. 

3.4.4 Nucleation in solids 

When the transformation takes place in the solid state, 
i.e. between two solid phases, a second factor giving 
rise to hysteresis operates. The new phase usually 
has a different parameter and crystal structure from 
the old so that the transformation is accompanied by 
dimensional changes. However, the changes in volume 
and shape cannot occur freely because of the rigidity of 
the surrounding matrix, and elastic strains are induced. 
The strain energy and surface energy created by the 
nuclei of the new phase are positive contributions to 
the free energy and so tend to oppose the transition. 
The total free energy change is 

AG = V AG V + Ay + VAG S (3.17) 

where A is the area of interface between the two phases 
and y the interfacial energy per unit area, and A G s is 
the misfit strain energy per unit volume of new phase. 
For a spherical nucleus of the second phase 

AG = \nr\AG v - AGj) + 4 nr 2 y (3.18) 

and the misfit strain energy reduces the effective driv- 
ing force for the transformation. Differentiation of 
equation (3.18) gives 

r c = -2y/(AG v - AG,), and 


IF (heterogeneous) — W (homogeneous) [*^(^)J 

The shape factor S(6) < 1 is dependent on the value 
of 0 and the work of nucleation is therefore less for 


W = I6jry 3 /3(AG V - AG,) 2 

The value of y can vary widely from a few mJ/m 2 
to several hundred mJ/m 2 depending on the coherency 
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Figure 3.48 Schematic representation of interface structures, (a) A coherent boundary with misfit strain and (b) a 
semi-coherent boundary with misfit dislocations. 


of the interface. A coherent interface is formed when 
the two crystals have a good ‘match’ and the two lat- 
tices are continuous across the interface. This happens 
when the interfacial plane has the same atomic config- 
uration in both phases, e.g. {1 1 1} in fee and {0001} 
in cph. When the ‘match’ at the interface is not perfect 
it is still possible to maintain coherency by strain- 
ing one or both lattices, as shown in Figure 3.48a. 
These coherency strains increase the energy and for 
large misfits it becomes energetically more favourable 
to form a semi-coherent interface in which the mis- 
match is periodically taken up by misfit dislocations. 1 
The coherency strains can then be relieved by a cross- 
grid of dislocations in the interface plane, the spac- 
ing of which depends on the Burgers vector b of the 
dislocation and the misfit s , i.e. b/e. The interfacial 
energy for semi-coherent interfaces arises from the 
change in composition across the interface or chemical 
contribution as for fully-coherent interfaces, plus the 
energy of the dislocations (see Chapter 4). The energy 
of a semi-coherent interface is 200-500 mJ/m 2 and 
increases with decreasing dislocation spacing until the 
dislocation strain fields overlap. When this occurs, the 
discrete nature of the dislocations is lost and the inter- 
face becomes incoherent. The incoherent interface is 
somewhat similar to a high-angle grain boundary (see 
Figure 3.3) with its energy of 0.5 to 1 J/m 2 relatively 
independent of the orientation. 

The surface and strain energy effects discussed 
above play an important role in phase separation. 
When there is coherence in the atomic structure across 
the interface between precipitate and matrix the sur- 
face energy term is small, and it is the strain energy 
factor which controls the shape of the particle. A 
plate-shaped particle is associated with the least strain 
energy, while a spherical- shaped particle is associated 
with maximum strain energy but the minimum surface 
energy. On the other hand, surface energy determines 
the crystallographic plane of the matrix on which a 

l A detailed treatment of dislocations and other defects is 

given in Chapter 4. 


plate-like precipitate forms. Thus, the habit plane is 
the one which allows the planes at the interface to fit 
together with the minimum of disregistry; the frequent 
occurrence of the Widmanstatten structures may be 
explained on this basis. It is also observed that precip- 
itation occurs most readily in regions of the structure 
which are somewhat disarranged, e.g. at grain bound- 
aries, inclusions, dislocations or other positions of high 
residual stress caused by plastic deformation. Such 
regions have an unusually high free energy and neces- 
sarily are the first areas to become unstable during the 
transformation. Also, new phases can form there with 
a minimum increase in surface energy. This behaviour 
is considered again in Chapter 7. 
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Chapter 4 


Defects in solids 


4.1 Types of imperfection 

Real solids invariably contain structural discontinuities 
and localized regions of disorder. This heterogene- 
ity can exist on both microscopic and macroscopic 
scales, with defects or imperfections ranging in size 
from missing or misplaced atoms to features that are 
visible to the naked eye. The majority of materials 
used for engineering components and structures are 
made up from a large number of small interlocking 
grains or crystals. It is therefore immediately appro- 
priate to regard the grain boundary surfaces of such 
polycrystalline aggregates as a type of imperfection. 
Other relatively large defects, such as shrinkage pores, 
gas bubbles, inclusions of foreign matter and cracks, 
may be found dispersed throughout the grains of a 
metal or ceramic material. In general, however, these 
large-scale defects are very much influenced by the 
processing of the material and are less fundamental to 
the basic material. More attention will thus be given 
to the atomic-scale defects in materials. Within each 
grain, atoms are regularly arranged according to the 
basic crystal structure but a variety of imperfections, 
classified generally as crystal defects, may also occur. 
A schematic diagram of these basic defects is shown 
in Figure 4.1. These take the form of: 

• Point defects, such as vacant atomic sites (or simply 
vacancies) and interstitial atoms (or simply intersti- 
tials) where an atom sits in an interstice rather than 
a normal lattice site 

• Line defects, such as dislocations 

• Planar defects, such as stacking faults and twin 
boundaries 

• Volume defects, such as voids, gas bubbles and 
cavities. 

In the following sections this type of classification 
will be used to consider the defects which can occur 
in metallic and ceramic crystals. Glasses already lack 
long-range order; we will therefore concentrate upon 
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Figure 4.1 (a) Vacancy -interstitial, (b) dislocation, 
(c) stacking fault, (d) void. 


crystal defects. Defects in crystalline macromolecular 
structures, as found in polymers, form a special subject 
and will be dealt with separately in Section 4.6.7. 


4.2 Point defects 

4.2.1 Point defects in metals 

Of the various lattice defects the vacancy is the only 
species that is ever present in appreciable concen- 
trations in thermodynamic equilibrium and increases 
exponentially with rise in temperature, as shown in 
Figure 4.2. The vacancy is formed by removing an 
atom from its lattice site and depositing it in a nearby 
atomic site where it can be easily accommodated. 
Favoured places are the free surface of the crystal, a 
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Figure 4.2 Equilibrium concentration of vacancies as a 
function of temperature for aluminium (after Bradshaw and 
Pearson, 1957). 


grain boundary or the extra half-plane of an edge dislo- 
cation. Such sites are termed vacancy sources and the 
vacancy is created when sufficient energy is available 
(e.g. thermal activation) to remove the atom. If Ef is 
the energy required to form one such defect (usually 
expressed in electron volts per atom), the total energy 
increase resulting from the formation of n such defects 
is nEf. The accompanying entropy increase may be 
calculated using the relations S = kin W, where W 
is the number of ways of distributing n defects and 
N atoms on N + n lattice sites, i.e. (N + n)\/n\N\ 
Then the free energy, G, or strictly F of a crystal of 
n defects, relative to the free energy of the perfect 
crystal, is 

F = nEt-kT\n[(N + n)\/n\N\] (4.1) 

which by the use of Stirling’s theorem 1 simplifies to 

F = nE f - k T [N + w)ln (N + n) - n In n - IV In N] 

(4.2) 

The equilibrium value of n is that for which 
dF /dn = 0, which defines the state of minimum free 
energy as shown in Figure 4. 3. 2 Thus, differentiating 
equation (4.2) gives 

0 = Ef - kT [In (N + n)- In n] 

= E f -kT In [(N + n)/n] 
so that 

— — = exp [-Ef/kT] 

N + n 

Usually N is very large compared with n, so that 
the expression can be taken to give the atomic concen- 
tration, c, of lattice vacancies, n/N = exp [—Ef/kT]. 
A more rigorous calculation of the concentration of 
vacancies in thermal equilibrium in a perfect lat- 
tice shows that although c is principally governed by 

Stirling’s approximation states that In TV! = N\n N. 

2 dF /dn or dG/dn in known as the chemical potential. 



Figure 4.3 Variation of the energy of a crystal with addition 
ofn vacancies. 


the Boltzmann factor exp [—Ef/kT], the effect of the 
vacancy on the vibrational properties of the lattice 
also leads to an entropy term which is independent 
of temperature and usually written as exp [Sf/k]. The 
fractional concentration may thus be written 

c = n/N = exp [Sf/k] exp [— Ef/kT] 

= A exp [—Ef/kT] (4.3) 


The value of the entropy term is not accurately 
known but it is usually taken to be within a factor 
of ten of the value 10; for simplicity we will take it to 
be unity. 

The equilibrium number of vacancies rises rapidly 
with increasing temperature, owing to the exponential 
form of the expression, and for most common metals 
has a value of about 10 -4 near the melting point. For 
example, kT at room temperature (300 K) is ~l/40 eV 
and for aluminium Ef = 0.7 eV, so that at 900 K we 
have 


c = exp 


7 40 3001 

To x T x 900 


= exp [-9.3] = KT' 9 - 3 ' 2 - 31 « 1(T 4 


As the temperature is lowered, c should decrease 
in order to maintain equilibrium and to do this the 
vacancies must migrate to positions in the structure 
where they can be annihilated; these locations are then 
known as ‘vacancy sinks’ and include such places as 
the free surface, grain boundaries and dislocations. 

The defect migrates by moving through the energy 
maxima from one atomic site to the next with a 
frequency 


fS m \ 

v = Vo exp 1 — 1 exp 


KT 
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where v 0 is the frequency of vibration of the defect 
in the appropriate direction, S m is the entropy increase 
and E m is the internal energy increase associated with 
the process. The self-diffusion coefficient in a pure 
metal is associated with the energy to form a vacancy 
Ef and the energy to move it E m , being given by the 
expression 

Esd =E f + E m 

Clearly the free surface of a sample or the grain 
boundary interface are a considerable distance, in 
atomic terms, from the centre of a grain and so 
dislocations in the body of the grain or crystal are 
the most efficient ‘sink’ for vacancies. Vacancies are 
annihilated at the edge of the extra half-plane of atoms 
of the dislocation, as shown in Figure 4.4a and 4.4b. 
This causes the dislocation to climb, as discussed 
in Section 4.3.4. The process whereby vacancies 
are annihilated at vacancy sinks such as surfaces, 
grain boundaries and dislocations, to satisfy the 
thermodynamic equilibrium concentration at a given 
temperature is, of course, reversible. When a metal 
is heated the equilibrium concentration increases and, 
to produce this additional concentration, the surfaces, 
grain boundaries and dislocations in the crystal reverse 
their role and act as vacancy sources and emit 
vacancies; the extra half-plane of atoms climbs in the 
opposite sense (see Figures 4.4c and 4.4d). 

Below a certain temperature, the migration of vacan- 
cies will be too slow for equilibrium to be main- 
tained, and at the lower temperatures a concentration 
of vacancies in excess of the equilibrium number will 


(a) (b) 


(c) (d) 

Figure 4.4 Climb of a dislocation, (a) and (b) to annihilate, 
(c) and (d) to create a vacancy. 


be retained in the structure. Moreover, if the cool- 
ing rate of the metal or alloy is particularly rapid, 
as, for example, in quenching, the vast majority of 
the vacancies which exist at high temperatures can be 
‘frozen-in’. 

Vacancies are of considerable importance in gov- 
erning the kinetics of many physical processes. The 
industrial processes of annealing, homogenization, pre- 
cipitation, sintering, surface-hardening, as well as oxi- 
dation and creep, all involve, to varying degrees, the 
transport of atoms through the structure with the help 
of vacancies. Similarly, vacancies enable dislocations 
to climb, since to move the extra half-plane of a dislo- 
cation up or down requires the mass transport of atoms. 
This mechanism is extremely important in the recov- 
ery stage of annealing and also enables dislocations to 
climb over obstacles lying in their slip plane; in this 
way materials can soften and lose their resistance to 
creep at high temperatures. 

In metals the energy of formation of an interstitial 
atom is much higher than that for a vacancy and 
is of the order of 4 eV. At temperatures just below 
the melting point, the concentration of such point 
defects is only about 10“ 15 and therefore interstitials 
are of little consequence in the normal behaviour of 
metals. They are, however, more important in ceramics 
because of the more open crystal structure. They are 
also of importance in the deformation behaviour of 
solids when point defects are produced by the non- 
conservative motion of jogs in screw dislocation (see 
Section 4.3.4) and also of particular importance in 
materials that have been subjected to irradiation by 
high-energy particles. 

4.2.2 Point defects in non-metallic crystals 

Point defects in non-metallic, particularly ionic, struc- 
tures are associated with additional features (e.g. the 
requirement to maintain electrical neutrality and the 
possibility of both anion-defects and cation-defects 
existing). An anion vacancy in NaCl, for example, will 
be a positively-charged defect and may trap an elec- 
tron to become a neutral F-centre. Alternatively, an 
anion vacancy may be associated with either an anion 
interstitial or a cation vacancy. The vacancy-interstitial 
pair is called a Frenkel defect and the vacancy pair a 
Schottky defect, as shown in Figure 4.5. Interstitials 
are much more common in ionic structures than metal- 
lic structures because of the large ‘holes’ or interstices 
that are available. 

In general, the formation energy of each of these two 
types of defect is different and this leads to different 
defect concentrations. With regard to vacancies, when 
Ef > Ef, i.e. the formation will initially produce 
more cation than anion vacancies from dislocations 
and boundaries as the temperature is raised. However, 
the electrical field produced will eventually oppose 
the production of further cations and promote the 
formation of anions such that of equilibrium there 
will be almost equal numbers of both types and the 
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Figure 4.5 Representation of point defects in two-dimensional ionic structure: (a) perfect structure and monovalent ions, 
(b) two Schottky defects, (c) Frenkel defect, and (d) substitutional divalent cation impurity and cation vacancy. 


combined or total concentration c of Schottky defects 
at high temperatures is ~10 -4 . 

Foreign ions with a valency different from the host 
cation may also give rise to point defects to maintain 
charge neutrality. Monovalent sodium ions substituting 
for divalent magnesium ions in MgO, for example, 
must be associated with an appropriate number of 
either cation interstitials or anion vacancies in order 
to maintain charge neutrality. Deviations from the sto- 
ichiometric composition of the non-metallic material 
as a result of excess (or deficiency) in one (or other) 
atomic species also results in the formation of point 
defects. 

An example of excess-metal due to anion vacancies 
is found in the oxidation of silicon which takes place at 
the metal/oxide interface. Interstitials are more likely 
to occur in oxides with open crystal structures and 
when one atom is much smaller than the other as, for 
example, ZnO (Figure 4.6a). The oxidation of copper 
to Cu 2 0, shown in Figure 4.6b, is an example of non- 
stoichiometry involving cation vacancies. Thus copper 
vacancies are created at the oxide surface and diffuse 
through the oxide layer and are eliminated at the 
oxide/metal interface. 

Oxides which contain point defects behave as semi- 
conductors when the electrons associated with the 
point defects either form positive holes or enter the 
conduction band of the oxide. If the electrons remain 
locally associated with the point defects, then charge 


can only be transferred by the diffusion of the charge 
carrying defects through the oxide. Both p- and n-type 
semiconductors are formed when oxides deviate from 
stoichiometry: the former arises from a deficiency of 
cations and the latter from an excess of cations. 

Examples of /7-type semiconducting oxides are NiO, 
PbO and Cu 2 0 while the oxides of Zn, Cd and Be are 
n-type semiconductors. 

4.2.3 Irradiation of solids 

There are many different kinds of high-energy radi- 
ation (e.g. neutrons, electrons, a-particles, protons, 
deuterons, uranium fission fragments, y-rays, X-rays) 
and all of them are capable of producing some form 
of ‘radiation damage’ in the materials they irradiate. 
While all are of importance to some aspects of the 
solid state, of particular interest is the behaviour of 
materials under irradiation in a nuclear reactor. This is 
because the neutrons produced in a reactor by a fis- 
sion reaction have extremely high energies of about 
2 million electron volts (i.e. 2 MeV), and being elec- 
trically uncharged, and consequently unaffected by the 
electrical fields surrounding an atomic nucleus, can 
travel large distances through a structure. The resul- 
tant damage is therefore not localized, but is distributed 
throughout the solid in the form of ‘damage spikes.’ 

The fast neutrons (they are given this name because 
2 MeV corresponds to a velocity of 2 x 10 7 ms 1 ) 
are slowed down, in order to produce further fission, 



Figure 4.6 Schematic arrangement of ions in two typical oxides, (a) Zn > j O, with excess metal due to cation interstitials and 
(b) Cu^O, with excess non-metal due to cation vacancies. 
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by the moderator in the pile until they are in thermal 
equilibrium with their surroundings. The neutrons in 
a pile will, therefore, have a spectrum of energies 
which ranges from about 1/40 eV at room temperature 
(thermal neutrons) to 2 MeV (fast neutrons). However, 
when non-fissile material is placed in a reactor and 
irradiated most of the damage is caused by the 
fast neutrons colliding with the atomic nuclei of the 
material. 

The nucleus of an atom has a small diameter (e.g. 
10“ 10 m), and consequently the largest area, or cross- 
section, which it presents to the neutron for collision 
is also small. The unit of cross-section is a barn, i.e. 
10 -28 m 2 so that in a material with a cross-section 
of 1 bam, an average of 10 9 neutrons would have to 
pass through an atom (cross-sectional area 10“ 19 m 2 ) 
for one to hit the nucleus. Conversely, the mean free 
path between collisions is about 10 9 atom spacings or 
about 0.3 m. If a metal such as copper (cross-section, 
4 barns) were irradiated for 1 day (10 5 s) in a neutron 
flux of 10 17 m -2 s -1 the number of neutrons passing 
through unit area, i.e. the integrated flux, would be 
10 22 n nT 2 and the chance of a given atom being hit 
(^integrated flux x cross-section) would be 4 x 10 -6 , 
i.e. about 1 atom in 250000 would have its nucleus 
stmck. 

For most metals the collision between an atomic 
nucleus and a neutron (or other fast particle of mass 
m) is usually purely elastic, and the struck atom 
mass M will have equal probability of receiving any 
kinetic energy between zero and the maximum E max = 
4E n Mm/(M + m) 2 , where E n is the energy of the fast 
neutron. Thus, the most energetic neutrons can impart 
an energy of as much as 200000 eV, to a copper 
atom initially at rest. Such an atom, called a primary 
‘knock-on’, will do much further damage on its sub- 
sequent passage through the stmcture often producing 
secondary and tertiary knock-on atoms, so that severe 
local damage results. The neutron, of course, also con- 
tinues its passage through the structure producing fur- 
ther primary displacements until the energy transferred 
in collisions is less than the energy E d (~25 eV for 
copper) necessary to displace an atom from its lat- 
tice site. 

The damage produced in irradiation consists largely 
of interstitials, i.e. atoms knocked into interstitial posi- 
tions in the lattice, and vacancies, i.e. the holes they 
leave behind. The damaged region, estimated to con- 
tain about 60 000 atoms, is expected to be originally 
pear-shaped in form, having the vacancies at the cen- 
tre and the interstitials towards the outside. Such a 
displacement spike or cascade of displaced atoms is 
shown schematically in Figure 4.7. The number of 
vacancy-interstitial pairs produced by one primary 
knock-on is given by n ~ E max /4E d , and for copper 
is about 1000. Owing to the thermal motion of the 
atoms in the lattice, appreciable self-annealing of the 
damage will take place at all except the lowest tem- 
peratures, with most of the vacancies and interstitials 



Figure 4.7 Formation of vacancies and interstitials due to 
particle bombardment ( after Cottrell, 1959; courtesy of the 
Institute of Mechanical Engineers ). 

annihilating each other by recombination. However, it 
is expected that some of the interstitials will escape 
from the surface of the cascade leaving a correspond- 
ing number of vacancies in the centre. If this number 
is assumed to be 100, the local concentration will be 
100/60000 or ^2 x 1(T 3 . 

Another manifestation of radiation damage concerns 
the dispersal of the energy of the stopped atom into 
the vibrational energy of the lattice. The energy is 
deposited in a small region, and for a very short 
time the metal may be regarded as locally heated. To 
distinguish this damage from the ‘displacement spike’, 
where the energy is sufficient to displace atoms, this 
heat-affected zone has been called a ‘thermal spike’. To 
raise the temperature by 1000°C requires about 3 R x 
4.2 kJ/mol or about 0.25 eV per atom. Consequently, 
a 25 eV thermal spike could heat about 100 atoms 
of copper to the melting point, which corresponds 
to a spherical region of radius about 0.75 nm. It is 
very doubtful if melting actually takes place, because 
the duration of the heat pulse is only about 10“ 11 to 
10- 12 s. However, it is not clear to what extent the 
heat produced gives rise to an annealing of the primary 
damage, or causes additional quenching damage (e.g. 
retention of high-temperature phases). 

Slow neutrons give rise to transmutation products. 
Of particular importance is the production of the noble 
gas elements, e.g. krypton and xenon produced by fis- 
sion in U and Pu, and helium in the light elements 
B, Li, Be and Mg. These transmuted atoms can cause 
severe radiation damage in two ways. First, the inert 
gas atoms are almost insoluble and hence in association 
with vacancies collect into gas bubbles which swell 
and crack the material. Second, these atoms are often 
created with very high energies (e.g. as o'-particles 
or fission fragments) and act as primary sources of 
knock-on damage. The fission of uranium into two 
new elements is the extreme example when the fis- 
sion fragments are thrown apart with kinetic energy 
~100 MeV. However, because the fragments carry 
a large charge their range is short and the damage 
restricted to the fissile material itself, or in materi- 
als which are in close proximity. Heavy ions can be 
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accelerated to kilovolt energies in accelerators to pro- 
duce heavy ion bombardment of materials being tested 
for reactor application. These moving particles have a 
short range and the damage is localized. 

4.2.4 Point defect concentration and 
annealing 

Electrical resistivity p is one of the simplest and 
most sensitive properties to investigate the point defect 
concentration. Point defects are potent scatterers of 
electrons and the increase in resistivity following 
quenching (A p) may be described by the equation 

Ap = A exp [— £V/kr Q ] (4.4) 

where A is a constant involving the entropy of 
formation, Ef is the formation energy of a vacancy 
and Tq the quenching temperature. Measuring the 
resistivity after quenching from different temperatures 
enables Ef to be estimated from a plot of A p 0 
versus 1/Tq. The activation energy, E m , for the 
movement of vacancies can be obtained by measuring 
the rate of annealing of the vacancies at different 
annealing temperatures. The rate of annealing is 
inversely proportional to the time to reach a certain 
value of ‘annealed-out’ resistivity. Thus, \/t\ =A 
exp [— E m /kT\] and l/t 2 = exp [— E m /kT 2 \ and by 
eliminating A we obtain In (t 2 /tf) = E m [(l/T 2 ) — 
(1/74 )]/k where E m is the only unknown in the 
expression. Values of Ef and E m for different materials 
are given in Table 4.1. 

At elevated temperatures the very high equilibrium 
concentration of vacancies which exists in the structure 
gives rise to the possible formation of divacancy and 
even tri-vacancy complexes, depending on the value 
of the appropriate binding energy. For equilibrium 
between single and di-vacancies, the total vacancy 
concentration is given by 

C v — C \ Y ~b 2 c*2v 

and the di- vacancy concentration by 

c 2v = Azciv 2 exp [fii/kTl 

where A is a constant involving the entropy of forma- 
tion of di-vacancies, B 2 the binding energy for vacancy 
pairs estimated to be in the range 0.1 -0.3 eV and z a 
configurational factor. The migration of di-vacancies is 


Table 4.1 Values of vacancy formation (Ef) and migration 
( E m ) energies for some metallic materials together with 
the self-diffusion energy (Esd) 


Energy 

(eV) 

Cu 

Al 

Ni 

Mg 

Fe 

W 

NiAl 

Ef 

1 . 0 - 1. 1 

0.76 

1.4 

0.9 

2.13 

3.3 

1.05 

Em 

1 . 0 - 1. 1 

0.62 

1.5 

0.5 

0.76 

1.9 

2.4 

E d 

2. 0-2. 2 

1.38 

2.9 

1.4 

2.89 

5.2 

3.45 


an easier process and the activation energy for migra- 
tion is somewhat lower than E m for single vacancies. 

Excess point defects are removed from a mate- 
rial when the vacancies and/or interstitials migrate to 
regions of discontinuity in the structure (e.g. free sur- 
faces, grain boundaries or dislocations) and are annihi- 
lated. These sites are termed defect sinks. The average 
number of atomic jumps made before annihilation is 
given by 

n = Azvt exp [—E m /kT a \ (4.5) 

where A is a constant (~1) involving the entropy of 
migration, z the coordination around a vacancy, v the 
Debye frequency (~10 13 /s), t the annealing time at the 
ageing temperature T a and E m the migration energy of 
the defect. For a metal such as aluminium, quenched 
to give a high concentration of retained vacancies, the 
annealing process takes place in two stages, as shown 
in Figure 4.8; stage I near room temperature with an 
activation energy ~0.58 eV and n ~ 10 4 , and stage II 
in the range 140-200°C with an activation energy of 
-1.3 eV. 

Assuming a random walk process, single vacancies 
would migrate an average distance (*/n x atomic spac- 
ing b) ~30 nm. This distance is very much less than 
either the distance to the grain boundary or the spac- 
ing of the dislocations in the annealed metal. In this 
case, the very high supersaturation of vacancies pro- 
duces a chemical stress, somewhat analogous to an 
osmotic pressure, which is sufficiently large to create 
new dislocations in the structure which provide many 
new ‘sinks’ to reduce this stress rapidly. 

The magnitude of this chemical stress may be 
estimated from the chemical potential, if we let d F 
represent the change of free energy when d n vacancies 
are added to the system. Then, 

dF/dn = E f + kT In (n/N) = -k T In c 0 + kT In c 
= kT In (c/co) 

where c is the actual concentration and c 0 the equilib- 
rium concentration of vacancies. This may be rewrit- 
ten as 



Temperature (°C) 


Figure 4.8 Variation of quenched- in resistivity with 
temperature of annealing for aluminium (after Panseri and 
Federighi, 1958, 1223). 
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dF/dV = Energy/volume = stress 

= (kr/b 3 )[ln (c/c 0 )] (4.6) 


where dV is the volume associated with d n vacancies 
and b 3 is the volume of one vacancy. Inserting typical 
values, K T — 1 /40 eV at room temperature, b = 
0.25 nm, shows K T/b 3 ~ 150 MN/m 2 . Thus, even a 
moderate 1% supersaturation of vacancies i.e. when 
(c/cq) = 1.01 and In (c/c 0 ) = 0.01, introduces a 
chemical stress a c equivalent to 1.5 MN/m 2 . 

The equilibrium concentration of vacancies at a 
temperature T 2 will be given by c 2 = exp[— Ef/kT 2 ] 
and at T\ by c\ = exp [—Ef/kT\\. Then, since 


ln(c 2 /ci) = (Ef/k) 


1 

T~i 


1 

T~2 


the chemical stress produced by quenching a metal 
from a high temperature T 2 to a low temperature T\ 
is 


a c = (kT/b 3 )ln(c 2 / Cl ) = (E t /b 3 ) 



For aluminium, Ef is about 0.7 eV so that quench- 
ing from 900 K to 300 K produces a chemical stress 
of about 3 GN/m 2 . This stress is extremely high, sev- 
eral times the theoretical yield stress, and must be 
relieved in some way. Migration of vacancies to grain 
boundaries and dislocations will occur, of course, but 
it is not surprising that the point defects form addi- 
tional vacancy sinks by the spontaneous nucleation of 
dislocations and other stable lattice defects, such as 
voids and stacking fault tetrahedra (see Sections 4.5.3 
and 4.6). 

When the material contains both vacancies and 
interstitials the removal of the excess point defect 
concentration is more complex. Figure 4.9 shows 
the ‘annealing’ curve for irradiated copper. The 
resistivity decreases sharply around 20 K when the 
interstitials start to migrate, with an activation 
energy E m ~ 0.1 eV. In Stage I, therefore, most 
of the Frenkel (interstitial -vacancy) pairs anneal 
out. Stage II has been attributed to the release of 
interstitials from impurity traps as thermal energy 
supplies the necessary activation energy. Stage III is 



Figure 4.9 Variation of resistivity with temperature 
produced by neutron irradiation for copper ( after Diehl). 


around room temperature and is probably caused by 
the annihilation of free interstitials with individual 
vacancies not associated with a Frenkel pair, and also 
the migration of di- vacancies. Stage IV corresponds to 
the stage I annealing of quenched metals arising from 
vacancy migration and annihilation to form dislocation 
loops, voids and other defects. Stage V corresponds 
to the removal of this secondary defect population by 
self-diffusion. 


4.3 Line defects 

4.3.1 Concept of a dislocation 

All crystalline materials usually contain lines of struc- 
tural discontinuities running throughout each crystal 
or grain. These line discontinuities are termed dislo- 
cations and there is usually about 10 10 to 10 12 m of 
dislocation line in a metre cube of material. 1 Disloca- 
tions enable materials to deform without destroying the 
basic crystal structure at stresses below that at which 
the material would break or fracture if they were not 
present. 

A crystal changes its shape during deformation by 
the slipping of atomic layers over one another. The 
theoretical shear strength of perfect crystals was first 
calculated by Frenkel for the simple rectangular-type 
lattice shown in Figure 4.10 with spacing a between 

! This is usually expressed as the density of dislocations 
p = IQ 10 to 10 12 m- 2 . 



Figure 4.10 Slip of crystal planes (a); shear stress versus displacement curve (b). 
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the planes. The shearing force required to move a plane 
of atoms over the plane below will be periodic, since 
for displacements x < b/ 2, where b is the spacing of 
atoms in the shear direction, the lattice resists the 
applied stress but for x > b/2 the lattice forces assist 
the applied stress. The simplest function with these 
properties is a sinusoidal relation of the form 

r = r m sin(27Tx/b) — T m 2nx/b 

where r m is the maximum shear stress at a 
displacement = b/ 4. For small displacements the 
elastic shear strain given by x/ a is equal to r / /x from 
Hooke’s law, where /x is the shear modulus, so that 

r m = (7z/27T)b/a (4.7) 

and since b ~ a, the theoretical strength of a perfect 
crystal is of the order of /x/10. 

This calculation shows that crystals should be rather 
strong and difficult to deform, but a striking exper- 
imental property of single crystals is their softness, 
which indicates that the critical shear stress to pro- 
duce slip is very small (about 10 -5 p or ~50gf mm -2 ). 
This discrepancy between the theoretical and experi- 
mental strength of crystals is accounted for if atomic 
planes do not slip over each other as rigid bodies but 
instead slip starts at a localized region in the structure 
and then spreads gradually over the remainder of the 
plane, somewhat like the disturbance when a pebble is 
dropped into a pond. 

In general, therefore, the slip plane may be divided 
into two regions, one where slip has occurred and the 
other which remains unslipped. Between the slipped 
and unslipped regions the structure will be dislocated 
(Figure 4.11); this boundary is referred to as a dislo- 
cation line, or dislocation. Three simple properties of 
a dislocation are immediately apparent, namely: (1) it 
is a line discontinuity, (2) it forms a closed loop in the 
interior of the crystal or emerges at the surface and 
(3) the difference in the amount of slip across the dis- 
location line is constant. The last property is probably 
the most important, since a dislocation is characterized 
by the magnitude and direction of the slip movement 
associated with it. This is called the Burgers vector, 
b, which for any given dislocation line is the same all 
along its length. 

4.3.2 Edge and screw dislocations 

It is evident from Figure 4.11a that some sections 
of the dislocation line are perpendicular to b , others 
are parallel to b while the remainder lie at an angle 
to b. This variation in the orientation of the line 
with respect to the Burgers vector gives rise to a 
difference in the structure of the dislocation. When 
the dislocation line is normal to the slip direction it is 
called an edge dislocation. In contrast, when the line 
of the dislocations is parallel to the slip direction the 
dislocation line is known as a screw dislocation. From 
the diagram shown in Figure 4.11a it is evident that 





Figure 4.11 Schematic representation of (a) a dislocation 
loop, (b) edge dislocation and (c) screw dislocation. 

the dislocation line is rarely pure edge or pure screw, 
but it is convenient to think of these ideal dislocations 
since any dislocation can be resolved into edge and 
screw components. The atomic structure of a simple 
edge and screw dislocation is shown in Figure 4.13 
and 4.14. 

4.3.3 The Burgers vector 

It is evident from the previous sections that the Burgers 
vector b is an important dislocation parameter. In any 
deformation situation the Burgers vector is defined by 
constructing a Burgers circuit in the dislocated crystal 
as shown in Figure 4.12. A sequence of lattice vectors 
is taken to form a closed clockwise circuit around 
the dislocation. The same sequence of vectors is then 
taken in the perfect lattice when it is found that the 
circuit fails to close. The closure vector FS (finish- 
start) defines b for the dislocation. With this FS/RH 
(right-hand) convention it is necessary to choose one 
direction along the dislocation line as positive. If this 
direction is reversed the vector b is also reversed. 
The Burgers vector defines the atomic displacement 
produced as the dislocation moves across the slip 
plane. Its value is governed by the crystal structure 
because during slip it is necessary to retain an identical 
lattice structure both before and after the passage of 
the dislocation. This is assured if the dislocation has 



Figure 4.12 Burgers circuit round a dislocation A fails to 
close when repeated in a perfect lattice unless completed by 
a closure vector FS equal to the Burgers vector b. 
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Figure 4.13 Slip caused by the movement of an edge dislocation. 





Figure 4.14 Slip caused by the movement of a screw dislocation. 


a Burgers vector equal to one lattice vector and, since 
the energy of a dislocation depends on the square of 
the Burgers vector (see Section 4.3. 5. 2), its Burgers 
vector is usually the shortest available lattice vector. 
This vector, by definition, is parallel to the direction 
of closest packing in the structure, which agrees with 
experimental observation of the slip direction. 

The Burgers vector is conveniently specified by 
its directional co-ordinates along the principal crys- 
tal axes. In the fee lattice, the shortest lattice vector 
is associated with slip from a cube comer to a face 
centre, and has components a/2, a/2, 0. This is usu- 
ally written a/2[l 10], where a is the lattice parameter 
and [110] is the slip direction. The magnitude of the 
vector, or the strength of the dislocation, is then given 
by y/{a 2 (l 2 + l 2 + 0 2 )/4} = a/V‘. 2 . The correspond- 
ing slip vectors for the bcc and eph structures are 
b = a/2[l 1 1] and b = a/3[l 120] respectively. 

4.3.4 Mechanisms of slip and climb 

The atomic structure of an edge dislocation is shown 
in Figure 4.13a. Here the extra half-plane of atoms is 
above the slip plane of the crystal, and consequently 
the dislocation is called a positive edge dislocation and 
is often denoted by the symbol 3L. When the half-plane 
is below the slip plane it is termed a negative disloca- 
tion. If the resolved shear stress on the slip plane is r 
and the Burgers vector of the dislocation b, the force 
on the dislocation, i.e. force per unit length of dislo- 
cation, is F = zb. This can be seen by reference to 


Figure 4.13 if the crystal is of side L. The force on the 
top face (stress x area) is r x L 2 . Thus, when the two 
halves of the crystal have slipped the relative amount b, 
the work done by the applied stress (force x distance) 
is zL 2 b. On the other hand, the work done in mov- 
ing the dislocation (total force on dislocation FL x 
distance moved) is FL 2 , so that equating the work 
done gives F (force per unit length of dislocation) = 
zb. Figure 4.13 indicates how slip is propagated by the 
movement of a dislocation under the action of such a 
force. The extra half-plane moves to the right until 
it produces the slip step shown at the surface of the 
crystal; the same shear will be produced by a negative 
dislocation moving from the right to left. 1 

The slip process as a result of a screw dislocation is 
shown in Figure 4.14. It must be recognized, however, 
that the dislocation is more usually a closed loop and 
slip occurs by the movement of all parts of the dislo- 
cation loop, i.e. edge, screw and mixed components, 
as shown in Figure 4.15. 

A dislocation is able to glide in that slip plane 
which contains both the line of the dislocation and 
its Burgers vector. The edge dislocation is confined 
to glide in one plane only. An important difference 
between the motion of a screw dislocation and that of 

! An obvious analogy to the slip process is the movement of 
a caterpillar in the garden or the propagation of a ruck in a 
carpet to move the carpet into place. In both examples, the 
effort to move is much reduced by this propagation process. 
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Figure 4.15 Process of slip by the expansion of a 
dislocation loop in the slip plane. 


an edge dislocation arises from the fact that the screw 
dislocation is cylindrically symmetrical about its axis 
with its b parallel to this axis. To a screw dislocation all 
crystal planes passing through the axis look the same 
and, therefore, the motion of the screw dislocation is 
not restricted to a single slip plane, as is the case 
for a gliding edge dislocation. The process thereby a 
screw dislocation glides into another slip plane having 
a slip direction in common with the original slip plane, 
as shown in Figure 4.16, is called cross-slip. Usually, 
the cross-slip plane is also a close-packed plane, e.g. 
{1 1 1} in fee crystals. 

The mechanism of slip illustrated above shows that 
the slip or glide motion of an edge dislocation is 
restricted, since it can only glide in that slip plane 



Figure 4.16 Cross-slip of a screw dislocation in a crystal. 


which contains both the dislocation line and its Burg- 
ers vector. However, movement of the dislocation line 
in a direction normal to the slip plane can occur under 
certain circumstances; this is called dislocation climb. 
To move the extra half-plane either up or down, as is 
required for climb, requires mass transport by diffu- 
sion and is a non-conservative motion. For example, 
if vacancies diffuse to the dislocation line it climbs up 
and the extra half-plane will shorten. However, since 
the vacancies will not necessarily arrive at the disloca- 
tion at the same instant, or uniformly, the dislocation 
climbs one atom at a time and some sections will lie in 
one plane and other sections in parallel neighbouring 
planes. Where the dislocation deviates from one plane 
to another it is known as a jog, and from the diagrams 
of Figure 4.17 it is evident that a jog in a dislocation 
may be regarded as a short length of dislocation not 
lying in the same slip plane as the main dislocation 
but having the same Burgers vector. 

Jogs may also form when a moving dislocation cuts 
through intersecting dislocations, i.e. forest 1 disloca- 
tions, during its glide motion. In the lower range of 
temperature this will be the major source of jogs. Two 
examples of jogs formed from the crossings of dislo- 
cations are shown in Figure 4.18. Figure 4.18a shows 
a crystal containing a screw dislocation running from 
top to bottom which has the effect of ‘ramping’ all 
the planes in the crystal. If an edge dislocation moves 
through the crystal on a horizontal plane then the screw 
dislocation becomes jogged as the top half of the crys- 
tal is sheared relative to the bottom. In addition, the 
screw dislocation becomes jogged since one part has to 
take the upper ramp and the other part the lower ramp. 
The result is shown schematically in Figure 4.18b. 
Figure 4.18c shows the situation for a moving screw 
cutting through the vertical screw; the jog formed in 
each dislocation is edge in character since it is per- 
pendicular to its Burgers vector which lies along the 
screw axis. 

A jog in an edge dislocation will not impede the 
motion of the dislocation in its slip plane because it 
can, in general, move with the main dislocation line 
by glide, not in the same slip plane (see Figure 4.17b) 



Figure 4.17 Climb of an edge dislocation in a crystal. 

l A number of dislocation lines may project from the slip 
plane like a forest, hence the term ‘forest dislocation’. 
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Figure 4.18 Dislocation intersections, (a) and 
(b) screw -edge, (c) screw-screw. 


but in an intersecting slip plane that does contain the 
line of the jog and the Burgers vector. In the case 
of a jog in a screw dislocation the situation is not so 
clear, since there are two ways in which the jog can 
move. Since the jog is merely a small piece of edge 
dislocation it may move sideways, i.e. conservatively, 
along the screw dislocation and attach itself to an edge 
component of the dislocation line. Conversely, the jog 
may be dragged along with the screw dislocation. This 
latter process requires the jog to climb and, because 
it is a non-conservative process, must give rise to 
the creation of a row of point defects, i.e. either 
vacancies or interstitials depending on which way the 
jog is forced to climb. Clearly, such a movement is 
difficult but, nevertheless, may be necessary to give the 
dislocation sufficient mobility. The ‘frictional’ drag of 


Figure 4.19 (a) Formation of a multiple jog by cross-slip, 
and (b) motion of jog to produce a dipole. 

jogs will make a contribution to the work-hardening 1 
of the material. 

Apart from elementary jogs, or those having a height 
equal to one atomic plane spacing, it is possible to 
have multiple jogs where the jog height is several 
atomic plane spacings. Such jogs can be produced, 
for example, by part of a screw dislocation cross- 
slipping from the primary plane to the cross-slip plane, 
as shown in Figure 4.19a. In this case, as the screw 
dislocation glides forward it trails the multiple jog 
behind, since it acts as a frictional drag. As a result, 
two parallel dislocations of opposite sign are cre- 
ated in the wake of the moving screw, as shown in 
Figure 4.19b; this arrangement is called a dislocation 
dipole. Dipoles formed as debris behind moving dis- 
locations are frequently seen in electron micrographs 
taken from deformed crystals (see Chapter 7). As the 
dipole gets longer the screw dislocation will eventu- 
ally jettison the debris by cross-slipping and pinching 
off the dipole to form a prismatic loop, as shown in 
Figure 4.20. The loop is capable of gliding on the sur- 
face of a prism, the cross-sectional area of which is 
that of the loop. 



Figure 4.20 Formation of prismatic dislocation loop from 
screw dislocation trailing a dipole. 

1 When material is deformed by straining or working the 
flow stress increases with increase in strain (i.e. it is harder 
to deform a material which has been strained already). This 
is called strain- or work-hardening. 
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4.3.5 Strain energy associated with 
dislocations 

4.3.5. 1 Stress fields of screw and edge 
dislocations 

The distortion around a dislocation line is evident from 
Figure 4.1 and 4.13. At the centre of the dislocation the 
strains are too large to be treated by elasticity theory, 
but beyond a distance r 0 , equal to a few atom spacing s 
Hooke’s law can be applied. It is therefore necessary 
to define a core to the dislocation at a cut-off radius r 0 
( ~b ) inside which elasticity theory is no longer appli- 
cable. A screw dislocation can then be considered as a 
cylindrical shell of length / and radius r contained in an 
elastically isotropic medium (Figure 4.21). A discon- 
tinuity in displacement exists only in the z-direction, 
i.e. parallel to the dislocation, such that u = v = 0, 
w — b. The elastic strain thus has to accommodate 
a displacement w = b around a length 2nr. In an 
elastically isotropic crystal the accommodation must 
occur equally all round the shell and indicates the 
simple relation w = b0/2jr in polar (r, 0, z) coor- 
dinates. The corresponding shear strain yo z (= y z e) = 
b I 2tzy and shear stress r ez {= x z6 ) = ub/2nr which acts 
on the end faces of the cylinder with o rr and x v q equal to 
zero. 1 Alternatively, the stresses are given in cartesian 
coordinates (x, y, z) 


are the normal stresses a xx , a yy along the x- and y-axes 
respectively, and the shear stress x xy which acts in the 
direction of the y-axis on planes perpendicular to the x- 
axis. The third normal stress cr zz = v{o xx + a yy ) where 
v is Poisson’s ratio, and the other shear stresses x yz 
and x 7X are zero. In polar coordinates r, 6 and z, the 
stresses are <r rr , o ee , and x r6 . 

Even in the case of the edge dislocation the dis- 
placement b has to be accommodated round a ring of 
length 2jrr, so that the strains and the stresses must 
contain a term in b/2nr. Moreover, because the atoms 
in the region 0 < 0 < n are under compression and for 
7t < 0 < 2tt in tension, the strain field must be of the 
form (b/2nr)f(Q), where f(0) is a function such as 
sin 0 which changes sign when 0 changes from 0 to 
2tt. It can be shown that the stresses are given by 

c 7 rr = Oqq = —D sin 0/r; o r6 = D cos 0/r; 
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(4.9) 


where D = iib/2jr(l — v). These equations show that 
the stresses around dislocations fall off as 1/r and 
hence the stress field is long-range in nature. 


t«(= r„) = -fiby/ 2n(x 2 + y 2 ) 

T y - ( = T-y ) = — nbx /2jt(x 2 + y 2 ) (4.8) 

with all other stresses equal to zero. The field of a 
screw dislocation is therefore purely one of shear, 
having radial symmetry with no dependence on 6. This 
mathematical description is related to the structure of 
a screw which has no extra half-plane of atoms and 
cannot be identified with a particular slip plane. 

An edge dislocation has a more complicated stress 
and strain field than a screw. The distortion associated 
with the edge dislocation is one of plane strain, since 
there are no displacements along the z-axis, i.e. w = 0. 
In plane deformation the only stresses to be determined 



^his subscript notation z0 indicates that the stress is in the 
^-direction on an element perpendicular to the z-direction. 
The stress with subscript rr is thus a normal stress and 
denoted by o rr and the subscript rO a shear stress and 
denoted by t r g. 


4.3. 5.2 Strain energy of a dislocation 

A dislocation is a line defect extending over large 
distances in the crystal and, since it has a strain energy 
per unit length (J m _1 ), it possesses a total strain 
energy. An estimate of the elastic strain energy of 
screw dislocation can be obtained by taking the strain 
energy (i.e. \ x stress x strain per unit volume) in an 
annular ring around the dislocation of radius r and 
thickness dr to be lx (/xZ?/27ir) x (b/2nr) x 27irdr. 
The total strain energy per unit length of dislocation is 
then obtained by integrating from r 0 the core radius, 
to r the outer radius of the strain field, and is 
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(4.10) 


With an edge dislocation this energy is modified by 
the term (1 — v) and hence is about 50% greater than 
a screw. For a unit dislocation in a typical crystal 
r 0 — 0.25 nm, r ~ 2.5 pm and In [r/r 0 ] — 9.2, so that 
the energy is approximately fib 2 per unit length of 
dislocation, which for copper [taking /x = 40 GNm -2 , 
b = 0.25 nm and 1 eV = 1.6 x 10 -19 J] is about 4 eV 
for every atom plane threaded by the dislocation. 2 If 
the reader prefers to think in terms of one metre of dis- 
location line, then this length is associated with about 
2 x 10 10 electron volts. We shall see later that heavily- 
deformed metals contain approximately 10 16 m/m 3 of 


2 The energy of the core must be added to this estimate. The 

core energy is about pi? 2 / 10 or f eV per atom length. 
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Figure 4.22 Interaction between dislocations not on the same slip plane: (a) unlike dislocation, (b) like dislocations. The 
arrangement in (b) constitutes a small-angle boundary. 


dislocation line which leads to a large amount of 
energy stored in the lattice (i.e. ~4J/g for Cu). Clearly, 
because of this high line energy a dislocation line will 
always tend to shorten its length as much as possi- 
ble, and from this point of view it may be considered 
to possess a line tension, T & a fib 2 , analogous to the 
surface energy of a soap film, where a ~ 

4. 3. 5. 3 Interaction of dislocations 
The strain field around a dislocation, because of its 
long-range nature, is also important in influencing the 
behaviour of other dislocations in the crystal. Thus, it 
is not difficult to imagine that a positive dislocation 
will attract a negative dislocation lying on the same 
slip plane in order that their respective strain fields 
should cancel. Moreover, as a general rule it can be 
said that the dislocations in a crystal will interact with 
each other to take up positions of minimum energy to 
reduce the total strain energy of the lattice. 

Two dislocations of the same sign will repel each 
other, because the strain energy of two dislocations 
on moving apart would be 2 x b 2 whereas if they 
combined to form one dislocation of Burgers vec- 
tor 2b, the strain energy would then be (2b) 2 = 4b 2 ; 
a force of repulsion exists between them. The force 
is, by definition, equal to the change of energy with 
position (dE/dr) and for screw dislocations is simply 
F = /ib 2 /2nr where r is the distance between the two 
dislocations. Since the stress field around screw dislo- 
cations has cylindrical symmetry the force of inter- 
action depends only on the distance apart, and the 
above expression for F applies equally well to par- 
allel screw dislocations on neighbouring slip planes. 
For parallel edge dislocations the force -distance rela- 
tionship is less simple. When the two edge dislocations 
lie in the same slip plane the relation is similar to that 
for two screws and has the form F = fib 2 / (l — v)2nr, 
but for edge dislocations with the same Burgers vector 
but not on the same slip plane the force also depends 


on the angle 0 between the Burgers vector and the line 
joining the two dislocations (Figure 4.22a). 

Edge dislocations of the same sign repel and oppo- 
site sign attract along the line between them, but the 
component of force in the direction of slip, which gov- 
erns the motion of a dislocation, varies with the angle 
6. With unlike dislocations an attractive force is expe- 
rienced for 0 > 45° but a repulsive force for 0 < 45°, 
and in equilibrium the dislocations remain at an angle 
of 45° to each other. For like dislocations the converse 
applies and the position 6 = 45° is now one of unstable 
equilibrium. Thus, edge dislocations which have the 
same Burgers vector but which do not lie on the same 
slip plane will be in equilibrium when 0 = 90°, and 
consequently they will arrange themselves in a plane 
normal to the slip plane, one above the other a distance 
h apart. Such a wall of dislocations constitutes a small- 
angle grain boundary as shown in Figure 4.22b, where 
the angle across the boundary is given by 0 = b/h. 
This type of dislocation array is also called a sub- 
grain or low- angle boundary, and is important in the 
annealing of deformed metals. 

By this arrangement the long-range stresses from 
the individual dislocations are cancelled out beyond a 
distance of the order of h from the boundary. It then 
follows that the energy of the crystal boundary will 
be given approximately by the sum of the individual 
energies, each equal to {/ib 2 /47z(\ — v)}ln (h/r 0 ) per 
unit length. There are 1 /h or 0/b dislocations in a unit 
length, vertically, and hence, in terms of the misorien- 
tation across the boundary 0 = b/h, the energy y gb per 
unit area of boundary is 
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where £0 = lib /4 tt( 1 — v) and A = In (/?/r 0 ); this is 
known as the Read- Shockley formula. Values from it 
give good agreement with experimental estimates even 
up to relatively large angles. For 6 ~ 25°, y gb ~ /ib/25 
or ~ 0.4 J/m 2 , which surprisingly is close to the value 
for the energy per unit area of a general large- angle 
grain boundary. 

4.3.6 Dislocations in ionic structures 

The slip system which operates in materials with NaCl 
structure is predominantly a/ 2(1 1 0) {1 10}. The clos- 
est packed plane {1 0 0} is not usually the preferred slip 
plane because of the strong electrostatic interaction that 
would occur across the slip plane during slip; like ions 
are brought into neighbouring positions across the slip 
plane for (1 00) but not for the (1 1 0). Dislocations in 
these materials are therefore simpler than fee metals, 
but they may carry an electric charge (the edge disloca- 
tion on {1 1 0}, for example). Figure 4.23a has an extra 
‘half-plane’ made up of a sheet of Na + ions and one of 
Cl - ions. The line as a whole can be charged up to a 
maximum of e/2 per atom length by acting as a source 
or sink for point defects. Figure 4.23b shows different 
jogs in the line which may either carry a charge or be 
uncharged. The jogs at B and C would be of charge 
+e / 2 because the section BC has a net charge equal 
to e. The jog at D is uncharged. 



Figure 4.23 Edge dislocation in NaCl, showing: (a) two 
extra half- sheets of ions: anions are open circles, cations are 
shaded; (b) charged and uncharged jogs (after Kelly and 
Groves, 1970). 


4.4 Planar defects 

4.4.1 Grain boundaries 

The small-angle boundary described in Section 4. 3. 5. 3 
is a particular example of a planar defect interface in 
a crystal. Many such planar defects occur in materials 
ranging from the large-angle grain boundary, which is 
an incoherent interface with a relatively high energy of 
~0.5 J/m 2 , to atomic planes in the crystal across which 
there is a mis-stacking of the atoms, i.e. twin interfaces 
and stacking faults which retain the coherency of the 
packing and have much lower energies (=€0.1 J/m 2 ). 
Generally, all these planar defects are associated with 
dislocations in an extended form. 

A small-angle tilt boundary can be described ade- 
quately by a vertical wall of dislocations. Rotation of 
one crystal relative to another, i.e. a twist boundary, 
can be produced by a crossed grid of two sets of screw 
dislocations as shown in Figure 4.24. These bound- 
aries are of a particularly simple kind separating two 
crystals which have a small difference in orientation, 
whereas a general grain boundary usually separates 
crystals which differ in orientation by large angles. In 
this case, the boundary has five degrees of freedom, 
three of which arise from the fact that the adjoin- 
ing crystals may be rotated with respect to each other 
about the three perpendicular axes, and the other two 
from the degree of freedom of the orientation of the 
boundary surface itself with respect to the crystals. 
Such a large-angle (30-40°) grain boundary may sim- 
ply be regarded as a narrow region, about two atoms 
thick, across which the atoms change from the lattice 
orientation of the one grain to that of the other. Nev- 
ertheless, such a grain boundary may be described by 
an arrangement of dislocations, but their arrangement 
will be complex and the individual dislocations are not 
easily recognized or analysed. 

The simplest extension of the dislocation model for 
low-angle boundaries to high-angle grain boundaries 
is to consider that there are islands of good atomic 
fit surrounded by non-coherent regions. In a low-angle 
boundary the ‘good fit’ is perfect crystal and the ‘bad 



Figure 4.24 Representation of a twist boundary produced 
by cross-grid of screw dislocations. 


98 Modern Physical Metallurgy and Materials Engineering 


fit’ is accommodated by lattice dislocations, whereas 
for high-angle boundaries the ‘good fit’ could be an 
interfacial structure with low energy and the bad fit 
accommodated by dislocations which are not neces- 
sarily lattice dislocations. These dislocations are often 
termed intrinsic secondary grain boundary dislocations 
(gbds) and are essential to maintain the boundary at 
that mis-orientation. 

The regions of good fit are sometimes described 
by the coincident site lattice (CSL) model, with its 
development to include the displacement shift complex 
(DSC) lattice. A CSL is a three-dimensional super- 
lattice on which a fraction 1 / Y of the lattice points 
in both crystal lattices lie; for the simple structures 
there will be many such CSLs, each existing at a 
particular misorientation. One CSL is illustrated in 
Figure 4.25 but it must be remembered that the CSL 
is three-dimensional, infinite and interpenetrates both 
crystals; it does not in itself define an interface. How- 
ever, an interface is likely to have a low energy if 
it lies between two crystals oriented such that they 
share a high proportion of lattice sites, i.e. preferred 
misorientations will be those with CSLs having low 
Y values. Such misorientations can be predicted from 
the expression 

6 = 2tarr' f-W) 


where b and a are integers and N = h 2 + k 2 + / 2 ; the 
Y value is then given by a 2 + Nb 2 , divided by 2 until 
an odd number is obtained. 

The CSL model can only be used to describe cer- 
tain specific boundary misorientations but it can be 
extended to other misorientations by allowing the pres- 
ence of arrays of dislocations which act to preserve a 
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Figure 4.25 Two-dimensional section of a CSL with 
Y$ 36 .9° [100] twist orientation (courtesy of 
P. Goodhew). 


special orientation between them. Such intrinsic sec- 
ondary dislocations must conserve the boundary struc- 
ture and, generally, will have Burgers vectors smaller 
than those of the lattice dislocations. 

When a polycrystalline specimen is examined in 
TEM other structural features apart from intrinsic grain 
boundary dislocations (gbds) may be observed in a 
grain boundary, such as ‘extrinsic’ dislocations which 
have probably run-in from a neighbouring grain, and 
interface ledges or steps which curve the boundary. At 
low temperatures the run-in lattice dislocation tends 
to retain its character while trapped in the interface, 
whereas at high temperatures it may dissociate into 
several intrinsic gbds resulting in a small change in 
misorientation across the boundary. The analysis of 
gbds in TEM is not easy, but information about them 
will eventually further our understanding of important 
boundary phenomena (e.g. migration of boundaries 
during recrystallization and grain growth, the sliding 
of grains during creep and superplastic flow and the 
way grain boundaries act as sources and sinks for point 
defects). 

4.4.2 Twin boundaries 

Annealing of cold-worked fee metals and alloys, such 
as copper, a-brass and austenitic stainless steels usu- 
ally causes many of the constituent crystals to form 
annealing twins. The lattice orientation changes at the 
twin boundary surface, producing a structure in which 
one part of the crystal or grain is the mirror-image of 
the other, the amount of atomic displacement being 
proportional to the distance from the twin boundary. 

The surfaces of a sample within and outside an 
annealing twin have different surface energies, because 
of their different lattice orientations, and hence respond 
quite differently when etched with a chemical etchant 
(Figure 4.26). In this diagram, twins 1 and 2 both 
have two straight parallel sides which are coherent 
low-energy interfaces. The short end face of twin 2 is 
non-coherent and therefore has a higher energy content 



Figure 4.26 Twinned regions within a single etched grain, 
produced by deformation and annealing. 
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Figure 4.27 Grain boundary /surf ace triple junction. 


per unit surface. Stacking faults are also coherent 
and low in energy content; consequently, because of 
this similarity in character, we find that crystalline 
materials which twin readily are also likely to contain 
many stacking faults (e.g. copper, a-brass). 

Although less important and less common than slip, 
another type of twinning can take place during plas- 
tic deformation. These so-called deformation twins 
sometimes form very easily, e.g. during mechanical 
polishing of metallographic samples of pure zinc; this 
process is discussed in Chapter 7. 

The free energies of interfaces can be determined 
from the equilibrium form of the triple junction where 
three interfaces, such as surfaces, grain boundaries 
or twins, meet. For the case of a grain boundary 
intersecting a free surface, shown in Figure 4.27, 


4.4. 3.2 Dissociation into Shockley partials 
The relationship between the two close-packed struc- 
tures cph and fee has been discussed in Chapter 2 
where it was seen that both structures may be built 
up from stacking close-packed planes of spheres. The 
shortest lattice vector in the fee structure joins a cube 
corner atom to a neighbouring face centre atom and 
defines the observed slip direction; one such slip vec- 
tor < 2 / 2 [ 101 ] is shown as b\ in Figure 4.28a which 
is for glide in the (111) plane. However, an atom 
which sits in a B position on top of the A plane would 
move most easily initially towards a C position and, 
consequently, to produce a macroscopical slip move- 
ment along [ 101 ] the atoms might be expected to 
take a zigzag path of the type B -> C -> B following 
the vectors b 2 = a/ 6 [ 2 ll] and b 3 = a/ 6[1 12 ] alter- 
nately. It will be evident, of course, that during the 
initial part of the slip process when the atoms change 
from B positions to C positions, a stacking fault in the 
( 111 ) layers is produced and the stacking sequence 
changes from ABC ABC ... to ABC AC ABC Dur- 
ing the second part of the slip process the correct 
stacking sequence is restored. 

To describe the atoms movement during slip, dis- 
cussed above, Heidenreich and Shockley have pointed 
out that the unit dislocation must dissociate into two 
half dislocations , 1 which for the case of glide in the 
( 111 ) plane would be according to the reaction: 


y gb = 2 y s cos 9/2 (4. 12) 

and hence y gh can be obtained by measuring the dihe- 
dral angle 0 and knowing y s . Similarly, measurements 
can be made of the ratio of twin boundary energy to 
the average grain boundary energy and, knowing either 
Ys or y g b gives an estimate of yy. 

4.4.3 Extended dislocations and stacking 
faults in close-packed crystals 

4.4.3. 1 Stacking faults 

Stacking faults associated with dislocations can be 
an extremely significant feature of the structure of 
many materials, particularly those with fee and cph 
structure. They arise because to a first approximation 
there is little to choose electrostatically between the 
stacking sequence of the close-packed planes in the 
fee metals ABCABC . . . and that in the cph metals 
ABABAB . . . Thus, in a metal like copper or gold, the 
atoms in a part of one of the close-packed layers may 
fall into the ‘wrong’ position relative to the atoms of 
the layers above and below, so that a mistake in the 
stacking sequence occurs (e.g. ABCBCABC . . .). Such 
an arrangement will be reasonably stable, but because 
some work will have to be done to produce it, stacking 
faults are more frequently found in deformed metals 
than annealed metals. 


a/2[l 01]^ a/6[2 1 1] + a/ 6[1 1 2] 

Such a dissociation process is (1) algebraically correct, 
since the sum of the Burgers vector components of the 
two partial dislocations, i.e. a/ 6[2 + 1 ], < 2 / 6 [1 + 1 ], 
a/ 6[1 + 2], are equal to the components of the Burg- 
ers vector of the unit dislocation, i.e. a/ 2 , 0 , a/ 2 , 
and ( 2 ) energetically favourable, since the sum of the 
strain energy values for the pair of half dislocations 
is less than the strain energy value of the single unit 
dislocation, where the initial dislocation energy is pro- 
portional to b\{= a 2 / 2) and the energy of the resultant 
partials to b\ + b\ = a 2 / 3. These half dislocations, or 
Shockley partial dislocations, repel each other by a 
force that is approximately F = (/xZ? 2 ^3 cos 60)/27iJ, 
and separate as shown in Figure 4.28b. A sheet of 
stacking faults is then formed in the slip plane between 
the partials, and it is the creation of this faulted region, 
which has a higher energy than the normal lattice, that 
prevents the partials from separating too far. Thus, if 
y J/m 2 is the energy per unit area of the fault, the force 
per unit length exerted on the dislocations by the fault 
is y N/m and the equilibrium separation d is given by 
equating the repulsive force F between the two half 
dislocations to the force exerted by the fault, y. The 
equilibrium separation of two partial dislocations is 

! The correct indices for the vectors involved in such 
dislocation reactions can be obtained from Figure 4.37. 
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Extended dislocation in (111) plane 

Figure 4.28 Schematic representation of slip in a (1 1 1) 
plane of a fee crystal. 


then given by 

pb 2 b 3 cos 60^ 
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from which it can be seen that the width of the stacking 
fault ‘ribbon’ is inversely proportional to the value of 
the stacking fault energy y and also depends on the 
value of the shear modulus /x. 

Figure 4.29a shows that the undissociated edge dis- 
location has its extra half-plane corrugated which may 
be considered as two (1 Ol) planes displaced relative 
to each other and labelled a and b in Figure 4.29b. On 
dissociation, planes a and b are separated by a region 
of crystal in which across the slip plane the atoms are 
in the wrong sites (see Figure 4.29c). Thus the high 
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Figure 4.29 Edge dislocation structure in the fee lattice, 
(a) and (b) undissociated, (c) and (d) dissociated: (a) and 
(c) are viewed normal to the {111) plane (from 
Hume-Rothery, Smallman and Haworth, 1969; courtesy of 
the Institute of Metals). 


strain energy along a line through the crystal associ- 
ated with an undissociated dislocation is spread over 
a plane in the crystal for a dissociated dislocation (see 
Figure 4.29d) thereby lowering its energy. 

A direct estimate of y can be made from the obser- 
vation of extended dislocations in the electron micro- 
scope and from observations on other stacking fault 
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defects (see Chapter 5). Such measurements show that 
the stacking fault energy for pure fee metals ranges 
from about 16 mJ/m 2 for silver to ~200 mJ/m 2 for 
nickel, with gold ~30, copper ~40 and aluminium 
135 mJ/m 2 , respectively. Since stacking faults are 
coherent interfaces or boundaries they have energies 
considerably lower than non-coherent interfaces such 
as free surfaces for which y s ~ pb/ 8 ~ 1.5 J/m 2 and 
grain boundaries for which y gb ~ y s /3 ~ 0.5 J/m 2 . 

The energy of a stacking fault can be estimated 
from twin boundary energies since a stacking fault 
ABCBCABC may be regarded as two overlapping 
twin boundaries CBC and BCB across which the next 
nearest neighbouring plane are wrongly stacked. In fee 
crystals any sequence of three atomic planes not in 
the ABC or CBA order is a stacking violation and 
is accompanied by an increased energy contribution. 
A twin has one pair of second nearest neighbour 
planes in the wrong sequence, two third neighbours, 
one fourth neighbour and so on; an intrinsic stacking 
fault two second nearest neighbours, three third and no 
fourth nearest neighbour violations. Thus, if next-next 
nearest neighbour interactions are considered to make 
a relatively small contribution to the energy then an 
approximate relation y ~ 2y T is expected. 

The frequency of occurrence of annealing twins gen- 
erally confirms the above classification of stacking 
fault energy and it is interesting to note that in alu- 
minium, a metal with a relatively high value of y, 
annealing twins are rarely, if ever, observed, while they 
are seen in copper which has a lower stacking fault 
energy. Electron microscope measurements of y show 
that the stacking fault energy is lowered by solid solu- 
tion alloying and is influenced by those factors which 
affect the limit of primary solubility. The reason for 
this is that on alloying, the free energies of the a-phase 
and its neighbouring phase become more nearly equal, 
i.e. the stability of the a-phase is decreased relative to 
some other phase, and hence can more readily tolerate 
mis-stacking. Figure 4.30 shows the reduction of y for 
copper with addition of solutes such as Zn, Al, Sn and 
Ge, and is consistent with the observation that anneal- 
ing twins occur more frequently in a-brass or Cu-Sn 
than pure copper. Substituting the appropriate values 
for p, a and y in equation (4.13) indicates that in silver 



Figure 4.30 Decrease in stacking-fault energy y for copper 
with alloying addition ( eja ). 


and copper the partials are separated to about 12 and 
6 atom spacings, respectively. For nickel the width is 
about 2b since although nickel has a high y its shear 
modulus is also very high. In contrast, aluminium has 
a lower y ~ 135 mJ/m 2 but also a considerably lower 
value for p and hence the partial separation is limited 
to about lb and may be considered to be unextended. 
Alloying significantly reduces y and very wide dislo- 
cations are produced, as found in the brasses, bronzes 
and austenitic stainless steels. However, no matter how 
narrow or how wide the partials are separated the two 
half-dislocations are bound together by the stacking 
fault, and consequently, they must move together as a 
unit across the slip plane. 

The width of the stacking fault ribbon is of impor- 
tance in many aspects of plasticity because at some 
stage of deformation it becomes necessary for disloca- 
tions to intersect each other; the difficulty which dislo- 
cations have in intersecting each other gives rise to one 
source of work-hardening. With extended dislocations 
the intersecting process is particularly difficult since 
the crossing of stacking faults would lead to a com- 
plex fault in the plane of intersection. The complexity 
may be reduced, however, if the half-dislocations coa- 
lesce at the crossing point, so that they intersect as 
perfect dislocations; the partials then are constricted 
together at their jogs, as shown in Figure 4.31a. 

The width of the stacking fault ribbon is also impor- 
tant to the phenomenon of cross-slip in which a 
dislocation changes from one slip plane to another 
intersecting slip plane. As discussed previously, for 
glide to occur the slip plane must contain both the 
Burgers vector and the line of the dislocation, and, 
consequently, for cross-slip to take place a dislocation 




Figure 4.31 (a) The crossing of extended dislocations, 
(b) various stages in the cross- slip of a dissociated screw 
dislocation. 
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must be in an exact screw orientation. If the dislo- 
cation is extended, however, the partials have first to 
be brought together to form an unextended dislocation 
as shown in Figure 4.31b before the dislocation can 
spread into the cross- slip plane. The constriction pro- 
cess will be aided by thermal activation and hence the 
cross-slip tendency increases with increasing temper- 
ature. The constriction process is also more difficult, 
the wider the separation of the partials. In aluminium, 
where the dislocations are relatively unextended, the 
frequent occurrence of cross-slip is expected, but for 
low stacking fault energy metals (e.g. copper or gold) 
the activation energy for the process will be high. Nev- 
ertheless, cross-slip may still occur in those regions 
where a high concentration of stress exists, as, for 
example, when dislocations pile up against some obsta- 
cle, where the width of the extended dislocation may 
be reduced below the equilibrium separation. Often 
screw dislocations escape from the piled-up group by 
cross-slipping but then after moving a certain distance 
in this cross-slip plane return to a plane parallel to the 
original slip plane because the resolved shear stress 
is higher. This is a common method of circumventing 
obstacles in the structure. 

4.4. 3. 3 Sessile dislocations 

The Shockley partial dislocation has its Burgers vec- 
tor lying in the plane of the fault and hence is glissile. 
Some dislocations, however, have their Burgers vector 
not lying in the plane of the fault with which they are 
associated, and are incapable of gliding, i.e. they are 
sessile. The simplest of these, the Frank sessile dislo- 
cation loop, is shown in Figure 4.32a. This dislocation 
is believed to form as a result of the collapse of the 
lattice surrounding a cavity which has been produced 
by the aggregation of vacancies on to a (1 1 1) plane. 
As shown in Figure 4.32a, if the vacancies aggregate 
on the central A-plane the adjoining parts of the neigh- 
bouring B and C planes collapse to fit in close-packed 
formation. The Burgers vector of the dislocation line 


bounding the collapsed sheet is normal to the plane 
with b = |[1 1 1], where a is the lattice parameter, and 
such a dislocation is sessile since it encloses an area 
of stacking fault which cannot move with the dislo- 
cation. A Frank sessile dislocation loop can also be 
produced by inserting an extra layer of atoms between 
two normal planes of atoms, as occurs when interstitial 
atoms aggregate following high energy particle irradi- 
ation. For the loop formed from vacancies the stacking 
sequence changes from the normal ABCABCA ... to 
ABCBCA . . ., whereas inserting a layer of atoms, e.g. 
an A-layer between B and C, the sequence becomes 

ABC ABAC A The former type of fault with one 

violation in the stacking sequence is called an intrin- 
sic fault, the latter with two violations is called an 
extrinsic fault. The stacking sequence violations are 
conveniently shown by using the symbol A to denote 
any normal stacking sequence AB, BC, CA but y for 
the reverse sequence AC, CB, BA. The normal fee 
stacking sequence is then given by A A A A . . ., the 
intrinsic fault by A A y A A . . . and the extrinsic fault 

by A A y y A A The reader may verify that the 

fault discussed in the previous Section is also an intrin- 
sic fault, and that a series of intrinsic stacking faults 
on neighbouring planes gives rise to a twinned struc- 
ture ABCABACBA or AAAAyyyyA. Elec- 
tron micrographs of Frank sessile dislocation loops are 
shown in Figures 4.38 and 4.39. 

Another common obstacle is that formed between 
extended dislocations on intersecting {111} slip 
planes, as shown in Figure 4.32b. Here, the 
combination of the leading partial dislocation lying in 
the (111) plane with that which lies in the (111) 
plane forms another partial dislocation, often referred 
to as a ‘stair-rod’ dislocation, at the junction of the 
two stacking fault ribbons by the reaction 

|[T 1 2] + | [2 IT] §[101] 

The indices for this reaction can be obtained from 
Figure 4.37 and it is seen that there is a reduction in 




Figure 4.32 Sessile dislocations: (a) a Frank sessile dislocation; (b) stair-rod dislocation as part of a Lomer-Cottrell barrier. 
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energy from [(a 2 / 6) + {a 2 / 6)] to a 2 1 18. This trian- 
gular group of partial dislocations, which bounds the 
wedge-shaped stacking fault ribbon lying in a (101) 
direction, is obviously incapable of gliding and such 
an obstacle, first considered by Lomer and Cottrell, 
is known as a Lomer-Cottrell barrier. Such a barrier 
impedes the motion of dislocations and leads to work- 
hardening, as discussed in Chapter 7. 

4.4. 3. 4 Stacking faults in ceramics 

Some ceramic oxides may be described in terms of 
fee or eph packing of the oxygen anions with the 
cations occupying the tetrahedral or octahedral intersti- 
tial sites, and these are more likely to contain stacking 
faults. Sapphire, a-Al 2 0 3 , deforms at high tempera- 
tures on the (0001) (1 120) basal systems and dis- 
sociated f (1 0 1 0) + | (0 1 1 0) dislocations have been 
observed. Stacking faults also occur in spinels. Stoi- 
chiometric spinel (MgAl 2 0 4 or MgO. nAl 2 0 3 , n = 1) 
deforms predominantly on the {1 1 1 } (1 10) slip system 
at high temperature (~1800°C) with dissociated dislo- 
cations. Non- stoichiometric crystals {n > 1) deform at 
lower temperatures when the { 1 1 0} (1 10) secondary 
slip system is also preferred. The stacking fault energy 
decreases with deviation from stoichiometry from a 
value around 0.2 J/m 2 for n = 1 crystals to around 
0.02 J/m 2 for n = 3.5. 

4.4.3.5 Stacking faults in semiconductors 
Elemental semiconductors Si, Ge with the diamond 
cubic structure or III-V compounds InSb with the spha- 
lerite (zinc blende) structure have perfect dislocation 
Burgers vectors similar to those in the fee lattice. 
Stacking faults on {1 1 1} planes associated with partial 
dislocations also exist. The {111} planes are stacked in 
the sequence AaBbCcAaBb as shown in Figure 4.33 
and stacking faults are created by the removal (intrin- 
sic) or insertion (extrinsic) of pairs of layers such 
as Aa or Bb. These faults do not change the four 
nearest-neighbour covalent bonds and are low-energy 


[ 112 ] 



Figure 4.33 A diamond cubic lattice projected normal to 
(1 1 0 ). O represents atoms in the plane of the paper and + 
represents atoms in the plane below. {Ill) is 
perpendicular to the plane of the paper and appears as a 
horizontal trace. 


faults ~50 mJ/m 2 . Dislocations could slip between 
the narrowly spaced planes Ba, called the glide set, 
or between the widely spaced planes bB, called the 
shuffle set, but weak beam microscopy shows dis- 
sociation into Shockley partials occurs on the glide 
set. A 60° dislocation (i.e. 60° to its Burgers vec- 
tor \a{\ 10)) of the glide set is formed by cutting 
out material bounded by the surface 1564 and then 
joining the cut together. The extra plane of atoms 
terminates between a and B leaving a row of dan- 
gling bonds along its core which leads to the electrical 
effect of a half-filled band in the band gap; plastic 
deformation can make n-type Ge into /7-type. The 60° 
dislocation BC and its dissociation into <5C and B 8 
is shown in Figure 4.34. The wurtzite (ZnS) structure 

has the hexagonal stacking sequence AaBbAaBb 

Similarly, stacking faults in the wurtzite structure are 
thin layers of sphalerite BbAaBbCcAA . . . analogous 
to stacking faults in hexagonal metals. 




Figure 4.34 (a) The 60° dislocation BC, (b) the dissociation of BC into SC and BS. 
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4.5 Volume defects 

4.5.1 Void formation and annealing 

Defects which occupy a volume within the crystal 
may take the form of voids, gas bubbles and cavities. 
These defects may form by heat-treatment, irradia- 
tion or deformation and their energy is derived largely 
from the surface energy (1-3 J/m 2 ). In some materi- 
als with low stacking fault energy a special type of 
three-dimensional defect is formed, namely the defect 
tetrahedron. This consists of a tetrahedron made up 
from stacking faults on the four {111} planes joined 
together by six low-energy stair-rod dislocations. This 
defect is discussed more fully in Section 4.6.2. 3. 

The growth of the original vacancy cluster into 
a three-dimensional aggregate, i.e. void, a collapsed 
vacancy disc, i.e. dislocation loop, should, in principle, 
depend on the relative surface to strain energy values 
for the respective defects. The energy of a three- 
dimensional void is mainly surface energy, whereas 
that of a Frank loop is mainly strain energy at small 
sizes. However, without a detailed knowledge of the 
surface energy of small voids and the core-energy of 
dislocations it is impossible to calculate, with any 
degree of confidence, the relative stability of these 
clustered vacancy defects. 

The clustering of vacancies to form voids has now 
been observed in a number of metals with either 
fee or eph structure. In as-quenched specimens the 
voids are not spherical but bounded by crystallographic 
faces (see Figure 4.58) and usually are about 50 nm 
radius in size. In fee metals they are octahedral in 
shape with sides along (1 10), sometimes truncated by 
{100} planes, and in eph metals bounded by prism 
and pyramidal planes. Void formation is favoured by 
slow quenching rates and high ageing temperatures, 
and the density of voids increases when gas is present 
in solid solution (e.g. hydrogen in copper, and either 
hydrogen or oxygen in silver). In aluminium and mag- 
nesium, void formation is favoured by quenching from 
a wet atmosphere, probably as a result of hydrogen 
production due to the oxidation reactions. It has been 
postulated that small clustered vacancy groups are sta- 
bilized by the presence of gas atoms and prevented 
from collapsing to a planar disc, so that some critical 
size for collapse can be exceeded. The voids are not 
conventional gas bubbles, however, since only a few 
gas atoms are required to nucleate the void after which 
it grows by vacancy adsorption. 

4.5.2 Irradiation and voiding 

Irradiation produces both interstitials and vacancies in 
excess of the equilibrium concentration. Both species 
initially cluster to form dislocation loops, but it is the 
interstitial loops formed from clustering of interstitials 
which eventually develop into a dislocation structure. 

In general, interstitial loops grow during irradiation 
because the large elastic misfit associated with an 
interstitial causes dislocations to attract interstitials 


more strongly than vacancies. Interstitial loops are 
therefore intrinsically stable defects, whereas vacancy 
loops are basically unstable defects during irradiation. 
Thus interstitials attracted to a vacancy loop, i.e. a 
loop formed by clustering vacancies, will cause it to 
shrink as the interstitials are annihilated. Increasing the 
irradiation temperature results in vacancies aggregating 
to form voids. Voids are formed in an intermediate 
temperature range ~ 0.3 to 0.6 T m , above that for long- 
range single vacancy migration and below that for 
thermal vacancy emission from voids. To create the 
excess vacancy concentration it is also necessary to 
build up a critical dislocation density from loop growth 
to bias the interstitial flow. 

There are two important factors contributing to void 
formation. The first is the degree of bias the disloca- 
tion density (developed from the growth of interstitial 
loops) has for attracting interstitials, which suppresses 
the interstitial content compared to vacancies. The 
second factor is the important role played in void 
nucleation by gases, both surface-active gases such 
as oxygen, nitrogen and hydrogen frequently present 
as residual impurities, and inert gases such as helium 
which may be generated continuously during irradia- 
tion due to transmutation reactions. The surface- active 
gases such as oxygen in copper can migrate to embryo 
vacancy clusters and reduce the surface energy. The 
inert gas atoms can acquire vacancies to become gas 
molecules inside voids (when the gas pressure is not in 
equilibrium with the void surface tension) or gas bub- 
bles when the gas pressure is considerable (P^2y s /r). 
Voids and bubbles can give rise to irradiation swelling 
and embrittlement of materials. 

4.5.3 Voiding and fracture 

The formation of voids is an important feature in 
the ductile failure of materials. The fracture process 
involves three stages. First, small holes or cavities 
nucleate usually at weak internal interfaces (e.g. par- 
ticle/matrix interfaces). These cavities then expand by 
plastic deformation and finally coalesce by localized 
necking of the metal between adjacent cavities to form 
a fibrous fracture. A scanning electron micrograph 
showing the characteristics of a typical ductile failure 
is shown in Figure 4.35. This type of fracture may be 
regarded as taking place by the nucleation of an inter- 
nal plastic cavity, rather than a crack, which grows 
outwards to meet the external neck which is growing 
inwards. Experimental evidence suggests that nucle- 
ation occurs at foreign particles. For example, OFHC 
copper necks down to over 90% reduction in area, 
whereas tough-pitch copper shows only 70% reduction 
in area; a similar behaviour is noted for super-pure and 
commercial purity aluminium. Thus if no inclusions 
were present, failure should occur by the specimen 
pulling apart entirely by the inward growth of the 
external neck, giving nearly 100% reduction in area. 
Dispersion-hardened materials in general fail with a 
ductile fracture, the fibrous region often consisting 
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Figure 4.35 SEM micrograph of a medium-carbon (0.4%) 
steel with a quenched and tempered martensite structure, 
showing large dimples associated with oxide inclusions and 
small dimples associated with small carbide precipitates 
(courtesy Dr L. Sidjanin). 


of many dimples arising from the dispersed particles 
nucleating holes and causing local ductile failure. Duc- 
tile failure is discussed further in Chapter 8. 


4.6 Defect behaviour in some real 
materials 

4.6.1 Dislocation vector diagrams and the 
Thompson tetrahedron 

The classification of defects into point, line, planar 
and volume is somewhat restrictive in presenting an 
overview of defect behaviour in materials, since it is 
clear, even from the discussion so far, that these defects 
are interrelated and interdependent. In the following 


sections these features will be brought out as well as 
those which relate to specific structures. 

In dealing with dislocation interactions and defects 
in real material it is often convenient to work with 
a vector notation rather than use the more conven- 
tional Miller indices notation. This may be illustrated 
by reference to the fee structure and the Thompson 
tetrahedron. 

All the dislocations common to the fee structure, 
discussed in the previous sections, can be represented 
conveniently by means of the Thompson reference 
tetrahedron (Figure 4.36a), formed by joining the three 
nearest face-centring atoms to the origin D. Here 
ABCD is made up of four {111} planes (1 1 1), (IT 1), 
(111) and (111) as shown by the stereogram given 
in Figure 4.36b, and the edges AB, BC, CA . . . corre- 
spond to the (110) directions in these planes. Then, 
if the mid-points of the faces are labelled a , f, y, 8 , 
as shown in Figure 4.37a, all the dislocation Burgers 
vectors are represented. Thus, the edges (AB, BC . . .) 
correspond to the normal slip vectors, a/ 2(1 10). The 
half-dislocations, or Shockley partials, into which these 
are dissociated have Burgers vectors of the a/6{l 1 2) 
type and are represented by the Roman- Greek sym- 
bols Ay, By, Dy, A 8, B5, etc, or Greek-Roman sym- 
bols yA, yB, yD, 8 A, 5B, etc. The dissociation reaction 
given in the first reaction in Section 4.4. 3.2 is then 
simply written 

BC -> B8 + <5C 

and there are six such dissociation reactions in each of 
the four {111} planes (see Figure 4.37). It is conven- 
tional to view the slip plane from outside the tetrahe- 
dron along the positive direction of the unit dislocation 
BC, and on dissociation to produce an intrinsic stack- 
ing fault arrangement; the Roman -Greek partial B8 
is on the right and the Greek-Roman partial 8C on 
the left. A screw dislocation with Burgers vector BC 
which is normally dissociated in the 5-plane is capable 
of cross-slipping into the a-plane by first constricting 
B8 + 8C -> BC and then redissociating in the a-plane 
BC -> Ba + aC. 


(a) | z 



(b) 



Figure 4.36 (a) Construction and (b) orientation of the Thompson tetrahedron ABCD. The slip directions in a given {111} 
plane may be obtained from the trace of that plane as shown for the (111) plane in (b). 
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Figure 4.37 A Thompson tetrahedron (a) closed and 
(b) opened out. In (b) the notation [110) is used in place of 
the usual notation [1 1 0] to indicate the sense of the vector 
direction. 

4.6.2 Dislocations and stacking faults in fee 
structures 

4.6.2. 1 Frank loops 

A powerful illustration of the use of the Thompson 
tetrahedron can be made if we look at simple Frank 
loops in fee metals (see Figure 4.32a). The Frank par- 
tial dislocation has a Burgers vector perpendicular to 
the (111) plane on which it lies and is represented 
by Aa, B/3, C y, D5, a A, etc. Such loops shown in 
the electron micrograph of Figure 4.38 have been pro- 
duced in aluminium by quenching from about 600° C. 
Each loop arises from the clustering of vacancies into 
a disc- shaped cavity which then form a dislocation 
loop. To reduce their energy, the loops take up reg- 
ular crystallographic forms with their edges parallel to 
the (1 1 0) directions in the loop plane. Along a (1 1 0) 
direction it can reduce its energy by dissociating on 
an intersecting {111} plane, forming a stair-rod at the 
junction of the two {111} planes, e.g. Aa -> A8 + 8a 
when the Frank dislocation lies along [101] common 
to both a- and 5-planes. 

Some of the loops shown in Figure 4.38 are not 
Frank sessile dislocations as expected, but prismatic 
dislocations, since no contrast of the type arising from 
stacking faults, can be seen within the defects. The 
fault will be removed by shear if it has a high stacking 
fault energy thereby changing the sessile Frank loop 
into a glissile prismatic loop according to the reaction 

a/ 3[1 1 1] + a/ 6[1 1 2] -* a/ 2[1 1 0] 

Stressing the foil while it is under observation in 
the microscope allows the unfaulting process to be 



Figure 4.38 Single-faulted, double-faulted (A) and unfaulted 
(B) dislocation loops in quenched aluminium ( after Edington 
and Smallman, 1965; courtesy of Taylor and Francis). 

observed directly (see Figure 4.39). This reaction is 
more easily followed with the aid of the Thompson 
tetrahedron and rewritten as 

D5 + 5C -> DC 

Physically, this means that the disc of vacancies aggre- 
gated on a (111) plane of a metal with high stack- 
ing fault energy, besides collapsing, also undergoes 
a shear movement. The dislocation loops shown in 
Figure 4.39b are therefore unit dislocations with their 
Burgers vector a/2[l 10] inclined at an angle to the 
original (111) plane. A prismatic dislocation loop lies 
on the surface of a cylinder, the cross-section of which 
is determined by the dislocation loop, and the axis of 
which is parallel to the [1 1 0] direction. Such a dislo- 
cation is not sessile, and under the action of a shear 
stress it is capable of movement by prismatic slip in 
the [1 1 0] direction. 

Many of the large Frank loops in Figure 4.38 
(for example, marked A) contain additional triangular- 
shaped loop contrast within the outer hexagonal loop. 



(a) (b) 


Figure 4.39 Removal of the stacking fault from a Frank 
sessile dislocation by stress (after Goodhew and Smallman ). 
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Figure 4.40 The structure of a double dislocation loop in 
quenched aluminium ( after Edington and Smallman, 1965; 
courtesy of Taylor and Francis ). 

The stacking fault fringes within the triangle are usu- 
ally displaced relative to those between the triangle and 
the hexagon by half the fringe spacing, which is the 
contrast expected from overlapping intrinsic stacking 
faults. The structural arrangement of those double- 
faulted loops is shown schematically in Figure 4.40, 
from which it can be seen that two intrinsic faults 
on next neighbouring planes are equivalent to an 
extrinsic fault. The observation of double-faulted loops 
in aluminium indicates that it is energetically more 
favourable to nucleate a Frank sessile loop on an exist- 
ing intrinsic fault than randomly in the perfect lattice, 
and it therefore follows that the energy of a double or 
extrinsic fault is less than twice that of the intrinsic 
fault, i.e. ye < 2y I . The double loops marked B have 
the outer intrinsic fault removed by stress. 

The addition of a third overlapping intrinsic fault 
would change the stacking sequence from the per- 
fect ABCABCABC to ABC f B f A f CABC, where 
the arrows indicate missing planes of atoms, and pro- 
duce a coherent twinned structure with two coherent 
twin boundaries. This structure would be energetically 
favourable to form, since y Wm < y\ < y E . It is pos- 
sible, however, to reduce the energy of the crystal 
even further by aggregating the third layer of vacan- 
cies between the two previously-formed neighbouring 
intrinsic faults to change the structure from an extrin- 
sically faulted ABC f B f ABC to perfect ABC f f f 
ABC structure. Such a triple-layer dislocation loop is 
shown in Figure 4.41. 

4. 6. 2. 2 Stair-rod dislocations 
The stair-rod dislocation formed at the apex of a 
Lomer-Cottrell barrier can also be represented by the 
Thompson notation. As an example, let us take the 
interaction between dislocations on the 8- and em- 
planes. Two unit dislocations BA and DB, respectively, 
are dissociated according to 

BA -> B5 + 8 A (on the 5-plane) 

and DB Da + cmB (on the cm-plane) 

and when the two Shockley partials cmB and B 8 inter- 
act, a stair-rod dislocation a8 = a/ 6[1 0 1] is formed. 
This low-energy dislocation is pure edge and there- 
fore sessile. If the other pair of partials interact then 
the resultant Burgers vector is (5 A + Dcm) = a/3[l 0 1] 


I — p/2 I 

\ 

% 

(a) 

Figure 4.41 Triple-loop and Frank sessile loop in Al-0.65% 
Mg (after Kritzinger, Smallman and Dobson, 1969; courtesy 
of P erg amon Press). 

and of higher energy. This vector is written in Thomp- 
son’ s notation as 5D/Acm and is a vector equal to twice 
the length joining the midpoints of 8 A and Dcm. 

4.6. 2. 3 Stacking-fault tetrahedra 
In fee metals and alloys, the vacancies may also clus- 
ter into a three-dimensional defect, forming a tetra- 
hedral arrangement of stacking faults on the four 
{111} planes with the six (11 0) edges of the tetrahe- 
dron, where the stacking faults bend from one {111} 
plane to another, consisting of stair-rod dislocations. 
The crystal structure is perfect inside and outside the 
tetrahedron, and the three-dimensional array of faults 
exhibits characteristic projected shape and contrast 
when seen in transmission electron micrographs as 
shown in Figure 4.44. This defect was observed orig- 
inally in quenched gold but occurs in other materials 
with low stacking-fault energy. One mechanism for the 
formation of the defect tetrahedron by the dissociation 
of a Frank dislocation loop (see Figure 4.42) was first 
explained by Silcox and Hirsch. The Frank partial dis- 
location bounding a stacking fault has, because of its 
large Burgers vector, a high strain energy, and hence 
can lower its energy by dissociation according to a 
reaction of the type 

a/ 3[1 1 1 ] -> a/ 6[1 21 ]+ a/ 6[1 0 1 ] 

(I) (s) +) 

The figures underneath the reaction represent the ener- 
gies of the dislocations, since they are proportional to 
the squares of the Burgers vectors. This reaction is, 
therefore, energetically favourable. This reaction can 
be seen with the aid of the Thompson tetrahedron, 
which shows that the Frank partial dislocation Aa can 
dissociate into a Shockley partial dislocation (A/3, A8 
or Ay) and a low energy stair-rod dislocation (/km, 8a 
or ya) for example Aa — ► Ay + ya. 
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dislocations, and (c) arrangement of the six stair-rod dislocations. 


The formation of the defect tetrahedron of stacking 
faults may be envisaged as follows. The collapse of 
a vacancy disc will, in the first instance, lead to the 
formation of a Frank sessile loop bounding a stacking 
fault, with edges parallel to the (110) directions. 
Each side of the loop then dissociates according to 
the above reaction into the appropriate stair-rod and 
partial dislocations, and, as shown in Figure 4.42a, the 
Shockley dislocations formed by dissociation will lie 
on intersecting {111} planes, above and below the 
plane of the hexagonal loop; the decrease in energy 
accompanying the dissociation will give rise to forces 
which tend to pull any rounded part of the loop 
into (1 10). Moreover, because the loop will not in 
general be a regular hexagon, the short sides will be 
eliminated by the preferential addition of vacancies at 
the constricted site, and a triangular- shaped loop will 
form (Figure 4.42b). The partials A/3, Ay and A8 bow 
out on their slip plane as they are repelled by the stair- 
rods. Taking into account the fact that adjacent ends 
of the bowing loops are of opposite sign, the partials 
attract each other in pairs to form stair-rod dislocations 
along DA, BA and CA , according to the reactions 

yA+Af -> yB, 8A+Ay -> 8y, fA+A8 -> /38 
In vector notation the reactions are of the type 

a/ 6[II2] + a/ 6[l2l] -* a/ 6[0 1 T] 

(I) (s) (I) 

(the reader may deduce the appropriate indices from 
Figure 4.37), and from the addition of the squares of 
the Burgers vectors underneath it is clear that this reac- 
tion is also energetically favourable. The final defect 
will therefore be a tetrahedron made up from the inter- 
section of stacking faults on the four {111} planes, so 
that the (110) edges of the tetrahedron will consist of 
low-energy stair-rod dislocations (Figure 4.42c). 

The tetrahedron of stacking faults formed by the 
above sequence of events is essentially symmetrical, 
and the same configuration would have been obtained 
if collapse had taken place originally on any other 
(111) plane. The energy of the system of stair-rod 
dislocations in the final configuration is proportional 


to 6 x jg = | , compared with 3 x | = 1 for the orig- 
inal stacking fault triangle bounded by Frank partials. 
Considering the dislocation energies alone, the disso- 
ciation leads to a lowering of energy to one-third of 
the original value. However, three additional stacking 
fault areas, with energies of y per unit area, have been 
newly created and if there is to be no net rise in energy 
these areas will impose an upper limit on the size of 
the tetrahedron formed. The student may wish to ver- 
ify that a calculation of this maximum size shows the 
side of the tetrahedron should be around 50 nm. 

De Jong and Koehler have proposed that the tetra- 
hedra may also form by the nucleation and growth 
of a three-dimensional vacancy cluster. The smallest 
cluster that is able to collapse to a tetrahedron and 
subsequently grow by the absorption of vacancies is a 
hexa-vacancy cluster. Growth would then occur by the 
nucleation and propagation of jog lines across the faces 
of the tetrahedron, as shown in Figure 4.43. The hexa- 
vacancy cluster may form by clustering di-vacancies 
and is aided by impurities which have excess positive 
change relative to the matrix (e.g. Mg, Cd or A1 in 
Au). Hydrogen in solution is also a potent nucleating 
agent because the di- vacancy/proton complex is mobile 
and attracted to ‘free’ di-vacancies. Figure 4.44 shows 
the increase in tetrahedra nucleation after preannealing 
gold in hydrogen. 

4.6.3 Dislocations and stacking faults in cph 
structures 

In a cph structure with axial ratio c/a, the most closely 
packed plane of atoms is the basal plane (0001) 



Figure 4.43 Jog line forming a ledge on the face of a 
tetrahedron. 
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Figure 4.44 Tetrahedra in gold (a) quenched and 

(b) preannealed in H 2 -N 2 gas ( after Johnston, Dobson and 

Smallman). 

and the most closely packed directions (1 12 0). The 
smallest unit lattice vector is a, but to indicate the 
direction of the vector (u, v, w) in Miller- Bravais 
indices it is written as a/ 3(1 120) where the mag- 
nitude of the vector in terms of the lattice param- 
eters is given by a[3(u 2 + uv + v 2 ) + ( c/a) 2 w 2 ] l/ 2 . 
The usual slip dislocation therefore has a Burgers 
vector a 1 3(1 120) and glides in the (0001) plane. 
This slip vector is (a/3, a/3, 2(a/3), 0) and has no 




Figure 4.45 Burgers vectors in the cph lattice ( after 
Berghezan, Fourdeux and Amelinckx, 1961; courtesy of 
Pergamon Press). 

component along the c-axis and so can be writ- 
ten without difficulty as a/ 3(1 120). However, when 
the vector has a component along the c-axis, as for 
example (a/3, a/3, 2 (a/ 3), 3c), difficulty arises and 
to avoid confusion the vectors are referred to unit 
distances ( a , a, a, c ) along the respective axes (e.g. 
1/3(1 12 0) and 1/3(1 12 3)). Other dislocations can 
be represented in a notation similar to that for the fee 
structure, but using a double-tetrahedron or bipyramid 
instead of the single tetrahedron previously adopted, 
as shown in Figure 4.45. An examination leads to the 
following simple types of dislocation: 

1. Six perfect dislocations with Burgers vectors in the 
basal plane along the sides of the triangular base 
ABC. They are AB, BC, CA, BA, CB and AC and 
are denoted by a or 1/3(1 120). 

2. Six partial dislocations with Burgers vectors in the 
basal plane represented by the vectors Act, Bcr, C o 
and their negatives. These dislocations arise from 
dissociation reactions of the type 

AB — ► A a + <tB 

and may also be written as p or | (1 0 1 0). 

3. Two perfect dislocations perpendicular to the basal 
plane represented by the vectors ST and TS of 
magnitude equal to the cell height c or (0001). 

4. Partial dislocations perpendicular to the basal plane 
represented by the vectors <rS, oT, S<r, T a of 
magnitude c/ 2 or | (0 0 0 1 ) . 

5. Twelve perfect dislocations of the type |(1 12 3) 
with a Burgers vector represented by SA/TB which 
is a vector equal to twice the join of the mid-points 
of SA and TB. These dislocations are more simply 
referred to as (c + a) dislocations. 

6. Twelve partial dislocations, which are a combina- 
tion of the partial basal and non-basal dislocations, 
and represented by vectors AS, BS, CS, AT, BT 
and CT or simply (c/2) + p equal to ^(2023). 
Although these vectors represent a displacement 
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Table 4.2 Dislocations in structures 


Type 

AB, BC 

Ao, Bo 

ST, TS 

oS, oT 

AS, BS 

SA/TB 

Vector 

1(1120) 

l(iolo) 

(0001) 

1(0001) 

1(2023) 

1(1123) 

Energy 

a 2 

a 2 / 3 

c 2 = 8o 2 /3 

2<3 2 /3 

« 2 

lla 2 /3 


from one atomic site to another the resultant dis- 
locations are imperfect because the two sites are 
not identical. 

The energies of the different dislocations are given 
in a relative scale in Table 4.2 assuming cl a is ideal. 

There are many similarities between the dislocations 
in the cph and fee structure and thus it is not necessary 
to discuss them in great detail. It is, however, of 
interest to consider the two basic processes of glide 
and climb. 

4.6.3. 1 Dislocation glide 

A perfect slip dislocation in the basal plane AB = 
| [1 2 1 0] may dissociate into two Shockley partial 
dislocations separating a ribbon of intrinsic stacking 
fault which violates the two next-nearest neighbours in 
the stacking sequence. There are actually two possible 
slip sequences: either a B -layer slides over an A-layer, 
i.e. Act followed by <tB (see Figure 4.46a) or an A- 
layer slides over a B -layer by the passage of a ctB 
partial followed by an A a (see Figure 4.46b). The 
dissociation given by 

AB — ► Act + ctB 

may be written in Miller-Bravais indices as 

i[l2l0] | [0 1 1 0] + |[T 100] 


This reaction is similar to that in the fee lattice and 
the width of the ribbon is again inversely proportional 
to the stacking fault energy y. Dislocations dissociated 
in the basal planes have been observed in cobalt, which 
undergoes a phase transformation and for which y 
is considered to be low (~25 mJ/m 2 ). For the other 
common cph metals Zn, Cd, Mg, Ti, Be, etc. y is 
high (250-300 mJ/m 2 ). No measurements of intrinsic 
faults with two next-nearest neighbour violations have 
been made, but intrinsic faults with one next-nearest 
neighbour violation have been measured and show 
that Mg ~ 125 ml/m 2 , Zn ~ 140 ml/m 2 , and Cd ~ 
150-175 mJ/m 2 . It is thus reasonable to conclude 
that intrinsic faults associated with Shockley partials 
have somewhat higher energy. Dislocations in these 
metals are therefore not very widely dissociated. A 
screw dislocation lying along a [1210] direction is 
capable of gliding in three different glide planes but 
the small extension in the basal plane will be sufficient 
to make basal glide easier than in either the pyramidal 
(1 Oil) or prismatic (1 OlO) glide (see Figure 7.19). 
Pyramidal and prismatic glide will be more favoured 
at high temperatures in metals with high stacking-fault 
energy when thermal activation aids the constriction 
of the dissociated dislocations. 

4. 6. 3. 2 Dislocation climb 

Stacking faults may be produced in hexagonal lattices 
by the aggregation of point defects. If vacancies 
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Figure 4.46 Stacking faults in the cph lattice (after Partridge, 1967; by courtesy of the American Society for Metals). 
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aggregate as a platelet, as shown in Figure 4.46c, 
the resultant collapse of the disc-shaped cavity 
(Figure 4.46d) would bring two similar layers into 
contact. This is a situation incompatible with the close- 
packing and suggests that simple Frank dislocations 
are energetically unfavourable in cph lattices. This 
unfavourable situation can be removed by either one 
of two mechanisms as shown in Figures 4.46e and 
4.46f. In Figure 4.46e the B -layer is converted to a 
C-position by passing a pair of equal and opposite 
partial dislocations (dipole) over adjacent slip planes. 
The Burgers vector of the dislocation loop will be 
of the aS type and the energy of the fault, which is 
extrinsic, will be high because of the three next nearest 
neighbour violations. In Figure 4.46f the loop is swept 
by a Aa-type partial dislocation which changes the 
stacking of all the layers above the loop according 
to the rule A —> B C —> A. The Burgers vector of 
the loop is of the type AS, and from the dislocation 
reaction A a + crS -> AS or | [1 0 1 0] + \ [0 0 0 1] -> 
| [2 02 3] and the associated stacking fault, which is 
intrinsic, will have a lower energy because there is only 
one next-nearest neighbour violation in the stacking 
sequence. Faulted loops with b = AS or (\c + p) have 




2p 


(a) 


(c) 


been observed in Zn, Mg and Cd (see Figure 4.47). 
Double-dislocation loops have also been observed 
when the inner dislocation loop encloses a central 
region of perfect crystal and the outer loop an annulus 
of stacking fault. The structure of such a double loop is 
shown in Figure 4.48. The vacancy loops on adjacent 
atomic planes are bounded by dislocations with non- 
parallel Burgers vectors, i.e. b = (\c + p) and b = 
(|c — p), respectively; the shear component of the 
second loop acts in such a direction as to eliminate the 
fault introduced by the first loop. There are six partial 
vectors in the basal plane pi, p 2 , P 3 and the negatives, 
and if one side of the loop is sheared by either p x , p 2 or 
p 2 the stacking sequence is changed according to A -> 
B — > C -> A, whereas reverse shearing A -> C -> 
B — > A, results from either —p x , —p 2 or —p 2 . It is 
clear that the fault introduced by a positive partial shear 
can be eliminated by a subsequent shear brought about 
by any of the three negative parti als. Three, four and 
more layered loops have also been observed in addition 
to the more common double loop. The addition of each 
layer of vacancies alternately introduces or removes 
stacking-faults, no matter whether the loops precipitate 
one above the other or on opposite sides of the original 
defect. 


(b) 


(d) 


Figure 4.47 Growth of single- and double-faulted loops in magnesium on annealing at 175° C for (a) t = 0 min, 
(b) t = 5 min, (c) t = 15 min and (d) t = 25 min (after Hales, Smallman and Dobson ). 



112 Modern Physical Metallurgy and Materials Engineering 



Figure 4.48 Structure of double-dislocation loop in cph 
lattice. 
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Figure 4.49 Dislocation loop formed by aggregation of 
interstitials in a cph lattice with (a) high-energy and 
(b) low-energy stacking fault. 

As in fee metals, interstitials may be aggregated into 
platelets on close-packed planes and the resultant struc- 
ture, shown in Figure 4.49a, is a dislocation loop with 
Burgers vector Sa, containing a high-energy stacking 
fault. This high-energy fault can be changed to one 
with lower energy by having the loop swept by a par- 
tial as shown in Figure 4.49b. 

All these faulted dislocation loops are capable of 
climbing by the addition or removal of point defects 
to the dislocation line. The shrinkage and growth of 
vacancy loops has been studied in some detail in Zn, 
Mg and Cd and examples, together with the climb 
analysis, are discussed in Section 4.7.1. 

4.6.4 Dislocations and stacking faults in bcc 
structures 

The shortest lattice vector in the bcc lattice is 
< 2 / 2 [l 1 1], which joins an atom at a cube corner to the 
one at the centre of the cube; this is the observed slip 
direction. The slip plane most commonly observed is 
(110) which, as shown in Figure 4.50, has a distorted 
close-packed structure. The (110) planes are packed 



Figure 4.50 The (110) plane of the bcc lattice ( after 
Weertman; by courtesy of Collier-Macmillan International ). 


in an AB ABAB sequence and three {110} type planes 
intersect along a (111) direction. It therefore follows 
that screw dislocations are capable of moving in any 
of the three {110} planes and for this reason the slip 
lines are often wavy and ill-defined. By analogy with 
the fee structure it is seen that in moving the B -layer 
along the [ill] direction it is easier to shear in the 
directions indicated by the three vectors b\, and 
b'i. These three vectors define a possible dissociation 
reaction 

a — a — a — a — 

-[ 111 ]^ -[1 1 0 ] + -[1 1 2 ] + -[1 1 0 ] 

The stacking fault energy of pure bcc metals is con- 
sidered to be very high, however, and hence no faults 
have been observed directly. Because of the stacking 
sequence ABABAB of the (110) planes the formation 
of a Frank partial dislocation in the bcc structure gives 
rise to a situation similar to that for the cph structure, 
i.e. the aggregation of vacancies or interstitials will 
bring either two A-layers or two B -layers into contact 
with each other. The correct stacking sequence can 
be restored by shearing the planes to produce perfect 
dislocations «/2[l 1 1] or a/2[l 11]. 

Slip has also been observed on planes indexed as 
(112) and (12 3) planes, and although some workers 
attribute this latter observation to varying amounts of 
slip on different (110) planes, there is evidence to 
indicate that (112) and (1 2 3) are definite slip planes. 
The packing of atoms in a (1 1 2) plane conforms to a 
rectangular pattern, the rows and columns parallel to 
the [1 1 0] and [111] directions, respectively, with the 
closest distance of approach along the [111] direc- 
tion. The stacking sequence of the (112) planes is 
ABCDEFAB . . . and the spacing between the planes 
6. It has often been suggested that the unit dislo- 
cation can dissociate in the (11 2) plane according to 
the reaction 

Cl — Cl — Cl — 

-[ 111 ]^ -[ 111 ] + -[ 111 ] 

15 o 
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because the homogeneous shear necessary to twin the 
structure is l/\/2 in a (11 1) on a (112) and this 
shear can be produced by a displacement a/6[l 1 1] on 
every successive (112) plane. It is therefore believed 
that twinning takes place by the movement of partial 
dislocations. However, it is generally recognized that 
the stacking fault energy is very high in bcc metals so 
that dissociation must be limited. Moreover, because 
the Burgers vectors of the partial dislocations are 
parallel, it is not possible to separate the partials by 
an applied stress unless one of them is anchored by 
some obstacle in the crystal. 

When the dislocation line lies along the [111] direc- 
tion it is capable of dissociating in any of the three 
{112} planes, i.e. (112), (121) and (21 1), which 
intersect along [1 1 1]. Furthermore, the a/2[l 1 1] 
screw dislocation could dissociate according to 

a - a - a - a - 

[ 111 ]^ -[ 111 ] + -[ 111 ] + -[ 111 ] 

Z ODD 

to form the symmetrical fault shown in Figure 4.51. 

The symmetrical configuration may be unstable, and 
the equilibrium configuration is one partial dislocation 
at the intersection of two {112} planes and the other 
two lying equidistant, one in each of the other two 
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Figure 4.51 Dissociated a/2 [111] dislocation in the bcc 
lattice (after Mitchell, Foxall and Hirsch, 1963; courtesy of 
Taylor and Francis ). 


planes. At larger stresses this unsymmetrical config- 
uration can be broken up and the partial dislocations 
induced to move on three neighbouring parallel planes, 
to produce a three-layer twin. In recent years an asym- 
metry of slip has been confirmed in many bcc single 
crystals, i.e. the preferred slip plane may differ in ten- 
sion and compression. A yield stress asymmetry has 
also been noted and has been related to asymmetric 
glide resistance of screw dislocations arising from their 
‘core’ structure. 

An alternative dissociation of the slip dislocation 
proposed by Cottrell is 

a a a — 

-[ 111 ]^ -[ 112 ] + -[ 111 ] 

ZD O 

The dissociation results in a twinning dislocation 
a/ 6[1 1 1] lying in the (1 1 2) plane and a a/3[l 1 2] 
partial dislocation with Burgers vector normal to the 
twin fault and hence is sessile. There is no reduction 
in energy by this reaction and is therefore not likely 
to occur except under favourable stress conditions. 

Another unit dislocation can exist in the bcc struc- 
ture, namely a[ 001], but it will normally be immobile. 
This dislocation can form at the intersection of normal 
slip bands by the reaction. 

|[TT 1] + |[1 1 1] ^ ^[001] 

with a reduction of strain energy from 3a 2 /2 to a 2 . 
The new a [001] dislocation lies in the (001) plane 
and is pure edge in character and may be considered as 
a wedge, one lattice constant thick, inserted between 
the (001) and hence has been considered as a crack 
nucleus. a[ 001] dislocations can also form in networks 
of a/2(l 1 1) type dislocations. 

4.6.5 Dislocations and stacking faults in 
ordered structures 

When the alloy orders, a unit dislocation in a disor- 
dered alloy becomes a partial-dislocation in the super- 
lattice with its attached anti-phase boundary interface, 
as shown in Figure 4.52a. Thus, when this dislocation 
moves through the lattice it will completely destroy 
the order across its slip plane. However, in an ordered 
alloy, any given atom prefers to have unlike atoms as 
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Figure 4.52 Dislocations in ordered structures. 
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its neighbours, and consequently such a process of slip 
would require a very high stress. To move a dislocation 
against the force y exerted on it by the fault requires 
a shear stress r = y/b, where b is the Burgers vec- 
tor; in /3-brass where y is about 0.07 N/m this stress 
is 300 MN/m 2 . In practice the critical shear stress of 
/3-brass is an order of magnitude less than this value, 
and thus one must conclude that slip occurs by an 
easier process than the movement of unit dislocations. 
In consequence, by analogy with the slip process in 
fee crystals, where the leading partial dislocation of 
an extended dislocation trails a stacking fault, it is 
believed that the dislocations which cause slip in an 
ordered lattice are not single dislocations but coupled 
pairs of dislocations, as shown in Figure 4.52b. The 
first dislocation of the pair, on moving across the slip 
plane, destroys the order and the second half of the 
couple completely restores it again, the third disloca- 
tion destroys it once more, and so on. In /3-brass 1 and 
similar weakly-ordered alloys such as AgMg and FeCo 
the crystal structure is ordered bcc (or CsCl-type) and, 
consequently, deformation is believed to occur by the 

1 Chapter 3, Figure 3.40, shows the CsCl or L 2 O structure. 
When disordered, the slip vector is a/2[l 1 1], but this 
vector in the ordered structure moves an A atom to a B site. 
The slip vector to move an A atom to an A site in twice the 
length and equal to a[ 1 1 1]. 


movement of coupled pairs of a/2[l 1 l]-type disloca- 
tions. The combined slip vector of the coupled pair of 
dislocations, sometimes called a super-dislocation, is 
then equivalent to a[l 1 1], and, since this vector con- 
nects like atoms in the structure, long-range order will 
be maintained. 

The separation of the super-partial dislocations may 
be calculated, as for Shockley partials, by equating the 
repulsive force between the two like a/ 2(1 1 1) disloca- 
tions to the surface tension of the anti-phase boundary. 
The values obtained for /3-brass and FeCo are about 70 
and 50 nm, respectively, and thus super-dislocations 
can be detected in the electron microscope using the 
weak beam technique (see Chapter 5). The separation 
is inversely proportional to the square of the ordering 
parameter and super-dislocation pairs ~12.5 nm width 
have been observed more readily in partly ordered 
FeCo (S = 0.59). 

In alloys with high ordering energies the antiphase 
boundaries associated with super-dislocations cannot 
be tolerated and dislocations with a Burgers vector 
equal to the unit lattice vector a (l 00) operate to pro- 
duce slip in (100) directions. The extreme case of this 
is in ionic-bonded crystals such as CsBr, but strongly- 
ordered intermetallic compounds such as NiAl are also 
observed to slip in the (100) direction with disloca- 
tions having b — a( 100). 

Ordered A 3 B-type alloys also give rise to super- 
dislocations. Figure 4.53a illustrates three (111) 



Figure 4.53 (a) Stacking of (1 1 1) planes of the LI 2 structure, illustrating the apb and fault vectors, and (b) schematic 
representation of super-dislocation structure. 
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layers of the Ll 2 structure, with different size atoms 
for each layer. The three vectors shown give rise 
to the formation of different planar faults; a/ 2 [1 01] 
is a super-partial producing apb, a/6[2 1 1] produces 
the familiar stacking fault, and a/ 3[1 12] produces a 
super-lattice intrinsic stacking fault (SISF). A [101] 
super-dislocation can therefore be composed of either 

[TO 1] ^[T0 1] + apb on (1 1 l) + "|l OIJ 

or 

[TO 1] -> ^[TT2] + SISF on (1 1 1)+ ^[21 1] 

Each of the a/ 2[101] super-partials may also dis- 
sociate, as for fee, according to 

a - a - a — 

-[ 101 ] ^ -[2 11 ] + -[11 2 ]. 

2 6 6 

The resultant super-dislocation is schematically 
shown in Figure 4.53b. In alloys such as Cu 3 Au, 
Ni 3 Mn, Ni 3 Al, etc., the stacking fault ribbon is 
too small to be observed experimentally but super- 
dislocations have been observed. It is evident, 
however, that the cross-slip of these super-dislocations 
will be an extremely difficult process. This can lead to 
a high work-hardening rate in these alloys, as discussed 
in Chapter 7. 

In an alloy possessing short-range order, slip will 
not occur by the motion of super-dislocations since 
there are no long-range faults to couple the dislocations 


together in pairs. However, because the distribution 
of neighbouring atoms is not random the passage 
of a dislocation will destroy the short-range order 
between the atoms, across the slip plane. As before, 
the stress to do this will be large but in this case there 
is no mechanism, such as coupling two dislocations 
together, to make the process easier. The fact that, 
for instance, a crystal of AuCu 3 in the quenched 
state (short-range order) has nearly double the yield 
strength of the annealed state (long-range order) may 
be explained on this basis. The maximum strength is 
exhibited by a partially-ordered alloy with a critical 
domain size of about 6 nm. The transition from 
deformation by unit dislocations in the disordered state 
to deformation by super-dislocations in the ordered 
condition gives rise to a peak in the flow stress with 
change in degree of order (see Chapter 6). 

4.6.6 Dislocations and stacking faults in 
ceramics 

At room temperature, the primary slip system in the 
fee structure of magnesia, MgO, is {1 1 0} (1 10). It 
is favoured because its Burgers vector is short and, 
most importantly, because this vector is parallel to 
rows of ions of like electrostatic charge, permitting 
the applied stress to shear the {1 1 0} planes past each 
other. Slip in the (10 0) directions is resisted at room 
temperature because it involves forcing ions of like 
charge into close proximity. If we consider the slip 
geometry of ionic crystals in terms of the Thompson 
tetrahedron for cubic ionic crystals (Figure 4.54), six 



Figure 4.54 Thompson tetrahedron for ionic crystals (cubic). 
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primary {110} slip planes extend from the central 
point O of the tetrahedron to its {1 1 0) edges. There is 
no dissociation in the {111} faces and slip is only 
possible along the (110) edges of the tetrahedron. 
Thus, for each of the {11 0} planes, there is only one 
(110) direction available. This limiting ‘one-to-one’ 
relation for a cubic ionic crystal contrasts with the 
‘three-to-one’ relation of the {1 1 1}(1 10) slip system 
in cubic metallic crystals. As an alternative to the direct 

‘easy’ translation AC, we might postulate the route 
AB + BC at room temperature. This process involves 
slip on plane OAB in the direction AB followed by slip 

on plane OCB in the direction BC. It is not favoured 
because, apart from being unfavoured in terms of 
energy, it involves a 60° change in slip direction and 
the critical resolved shear stress for the second stage is 
likely to be much greater than that needed to activate 
the first set of planes. The two- stage route is therefore 
a difficult one. Finally, it will be noticed that the 
central point lies at the junction of the (111) directions 
which, being in a cubic system, are perpendicular to 
the four {111} faces. One can thus appreciate why 
raising the temperature of an ionic crystal often allows 
the {1 1 0} ( 1 1 1) system to become active. 

In a single crystal of alumina, which is 
rhombohedral-hexagonal in structure and highly 
anisotropic, slip is confined to the basal planes. 
At temperatures above 900° C, the slip system is 
{000 1 } ( 1 1 20). As seen from Figure 2.18, this 
resultant slip direction is not one of close-packing. 
If a unit translation of shear is to take place in 


a (1120)-type direction, the movement and re- 
registration of oxygen anions and aluminium cations 
must be in synchronism (‘synchro- shear’). Figure 4.55 
shows the Burgers vectors for slip in the [112 0] 
direction in terms of the two modes of dissociation 
proposed by M. Kronberg. These two routes are 
energetically favoured. The dissociation reaction 
for the oxygen anions is: |[1 1 20] -> |[1 OlO] + 
|[0 1 10]. The vectors for these two half-partials lie in 
close-packed directions and enclose a stacking fault. 
In the case of the smaller aluminium cations, further 
dissociation of each of similar half-partials takes 
place (e.g. }[10l0] |[2ll 0] + ±[1 120]). These 

quarter-partials enclose three regions in which stacking 
is faulted. Slip involves a synchronized movement of 
both types of planar fault (single and triple) across the 
basal planes. 

4.6.7 Defects in crystalline polymers 

Crystalline regions in polymers are based upon long- 
chain molecules and are usually associated with at 
least some glassy (amorphous) regions. Although 
less intensively studied than defect structures in 
metals and ceramics, similar crystal defects, such 
as vacancies, interstitials and dislocations, have been 
observed in polymers. Their association with linear 
macromolecules, however, introduces certain special 
features. For instance, the chain ends of molecules 
can be regarded as point defects because they differ in 
chemical character from the chain proper. Vacancies, 
usually associated with chain ends, and foreign atoms, 
acting as interstitials, are also present. Edge and 
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Figure 4.55 Dissociation and synchronized shear in basal planes of alumina. 
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screw dislocations have been detected. 1 Moire pattern 
techniques of electron microscopy are useful for 
revealing the presence of edge dislocations. Growth 
spirals, centred on screw dislocations, have frequently 
been observed on surfaces of crystalline polymers, 
with the Burgers vector, dislocation axis and chain 
directions lying in parallel directions (e.g. polyethylene 
crystals grown from a concentrated melt). 

Within crystalline regions, such as spherulites com- 
posed of folded chain molecules, discrepancies in fold- 
ing may be regarded as defects. The nature of the 
two-dimensional surface at the faces of spherulites 
where chains emerge, fold and re-enter the crystalline 
region is of particular interest. Similarly, the surfaces 
where spherulite edges impinge upon each other can 
be regarded as planar defects, being analogous to grain 
boundary surfaces. 

X-ray diffraction studies of line-broadening effects 
and transmission electron microscopy have been used 
to elucidate crystal defects in polymers. In the latter 
case, the high energy of an electron beam can damage 
the polymer crystals and introduce artefacts. It is 
recognized that the special structural features found 
in polymer crystals such as the comparative thinness 
of many crystals, chain-folding, the tendency of the 
molecules to resist bending of bonds and the great 
difference between primary intramolecular bonding 
and secondary intermolecular bonding, make them 
unique and very different to metallic and ceramic 
crystals. 

4.6.8 Defects in glasses 

It is recognized that real glass structures are less homo- 
geneous than the random network model might sug- 
gest. Adjacent glassy regions can differ abruptly in 
composition, giving rise to ‘cords’, and it has been pro- 
posed that extremely small micro-crystalline regions 
may exist within the glass matrix. Tinting of clear 
glass is evidence for the presence of trace amounts of 
impurity atoms (iron, chromium) dispersed throughout 
the structure. Modifying ions of sodium are relatively 
loosely held in the interstices and have been known to 
migrate through the structure and aggregate close to 
free surfaces. On a coarser scale, it is possible for bub- 
bles (‘seeds’), rounded by surface tension, to persist 
from melting/fining operations. Bubbles may contain 
gases, such as carbon dioxide, sulphur dioxide and sul- 
phur trioxide. Solid inclusions (‘stones’) of crystalline 
matter, such as silica, alumina and silicates, may be 
present in the glass as a result of incomplete fusion, 
interaction with refractory furnace linings and localized 
crystallization (devitrification) during the final cooling. 

transmission electron microscopy of the organic 
compound platinum phthalocyanine which has relatively 
large intermolecular spacing provided the first visual 
evidence for the existence of edge dislocations. 


4.7 Stability of defects 

4.7.1 Dislocation loops 

During annealing, defects such as dislocation loops, 
stacking-fault tetrahedra and voids may exhibit 
shrinkage in size. This may be strikingly demonstrated 
by observing a heated specimen in the microscope. On 
heating, the dislocation loops and voids act as vacancy 
sources and shrink, and hence the defects annihilate 
themselves. This process occurs in the temperature 
range where self-diffusion is rapid, and confirms that 
the removal of the residual resistivity associated with 
Stage II is due to the dispersal of the loops, voids, etc. 

The driving force for the emission of vacancies from 
a vacancy defect arises in the case of (1) a prismatic 
loop from the line tension of the dislocation, (2) a 
Frank loop from the force due to the stacking fault 
on the dislocation line since in intermediate and high 
y-metals this force far outweighs the line tension con- 
tribution, and (3) a void from the surface energy y s . 
The annealing of Frank loops and voids in quenched 
aluminium is shown in Figures 4.56 and 4.58, respec- 
tively. In a thin metal foil the rate of annealing is 
generally controlled by the rate of diffusion of vacan- 
cies away from the defect to any nearby sinks, usually 
the foil surfaces, rather than the emission of vacan- 
cies at the defect itself. To derive the rate equation 
governing the annealing, the vacancy concentration 
at the surface of the defect is used as one bound- 
ary condition of a diffusion-controlled problem and 
the second boundary condition is obtained by assum- 
ing that the surfaces of a thin foil act as ideal sinks 
for vacancies. The rate then depends on the vacancy 
concentration gradient developed between the defect, 
where the vacancy concentration is given by 

c = Co exp {(dF/dn)/kT) (4.14) 

with (dF /dn) the change of free energy of the defect 
configuration per vacancy emitted at the temperature 
T, and the foil surface where the concentration is the 
equilibrium value c 0 . 

For a single, intrinsically-faulted circular dislocation 
loop of radius r the total energy of the defect F is given 
by the sum of the line energy and the fault energy, i.e. 

F ~ 2nr{[iib 2 — v)] In (r/r 0 )} + n r 2 y 

In the case of a large loop (r > 50 nm) in a mate- 
rial of intermediate or high stacking fault energy 
(y > 60 mJ/m 2 ) the term involving the dislocation line 
energy is negligible compared with the stacking fault 
energy term and thus, since (dF /dn) = (dF /dr) x 
(dr/dn), is given simply by yB 2 , where B 2 is the 
cross-sectional area of a vacancy in the (111) plane. 
For large loops the diffusion geometry approximates 
to cylindrical diffusion 2 and a solution of the time- 
independent diffusion equation gives for the anneal- 
ing rate, 

2 For spherical diffusion geometry the pre-exponential 
constant is D/b. 
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Figure 4.56 Climb of faulted loops in aluminium at 140° C. (a) t = 0 min, (b) t = 12 min, (c) t = 24 min, (d) t = 30 min 
(after Dobson, Goodhew and Smallman, 1967; courtesy of Taylor and Francis). 



Figure 4.57 Variation of loop radius with time of annealing 
for Frank dislocations in Al showing the deviation from 
linearity at small r. 


dr /At = —[2nD/b In (L/b)][exp (yB 2 /kT) - 1] 

= const, [exp (yB 2 /kT) — 1] (4.15) 

where D = D 0 exp (—U^/kT) is the coefficient of self- 
diffusion and L is half the foil thickness. The annealing 
rate of a prismatic dislocation loop can be similarly 
determined, in this case dF/dr is determined solely 
by the line energy, and then 

dr/dt = -[InD/bln ( L/b)\(ab/r ) 

= const [ab/r] (4.16) 



Figure 4.58 Sequence of micrographs showing the 
shrinkage of voids in quenched aluminium during isothermal 
annealing at 170° C. (a) t = 3 min, (b) t = 8 min, 

(c) t = 21 min, (d) t = 46 min, (e) t = 98 min. In all 
micrographs the scale corresponds to 0.1 ]im ( after 
Westmacott, Smallman and Dobson, 1968, 117; courtesy of 
the Institute of Metals). 


where the term containing the dislocation line energy 
can be approximated to otb/r. The annealing of Frank 
loops obeys the linear relation given by equation (4.15) 
at large r (Figure 4.57); at small r the curve devi- 
ates from linearity because the line tension term can 
no longer be neglected and also because the diffu- 
sion geometry changes from cylindrical to spherical 
symmetry. The annealing of prismatic loops is much 
slower, because only the line tension term is involved, 
and obeys an r 2 versus t relationship. 

In principle, equation (4.15) affords a direct deter- 
mination of the stacking fault energy y by substitution, 
but since U D is usually much bigger than yB 2 this 
method is unduly sensitive to small errors in U D . This 
difficulty may be eliminated, however, by a compar- 
ative method in which the annealing rate of a faulted 
loop is compared to that of a prismatic one at the 
same temperature. The intrinsic stacking fault energy 
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of aluminium has been shown to be 135 mJ/m 2 by this 
technique. 

In addition to prismatic and single-faulted (Frank) 
dislocation loops, double-faulted loops have also been 
annealed in a number of quenched fee metals. It is 
observed that on annealing, the intrinsic loop first 
shrinks until it meets the inner, extrinsically-faulted 
region, following which the two loops shrink together 
as one extrinsically-faulted loop. The rate of annealing 
of this extrinsic fault may be derived in a way similar 
to equation (4.15) and is given by 

dr/dt = -[nD/bln (L/b)][ex p (y E B 2 /kT) - 1] 

= const, {exp (y E S 2 /2k7’) — 1} (4.17) 

from which the extrinsic stacking-fault energy may be 
determined. Generally ye is about 10-30% higher in 
value than the intrinsic energy y 

Loop growth can occur when the direction of the 
vacancy flux is towards the loop rather than away from 
it, as in the case of loop shrinkage. This condition 
can arise when the foil surface becomes a vacancy 
source, as, for example, during the growth of a surface 
oxide film. Loop growth is thus commonly found in 
Zn, Mg, Cd, although loop shrinkage is occasionally 
observed, presumably due to the formation of local 
cracks in the oxide film at which vacancies can be 
annihilated. Figure 4.47 shows loops growing in Mg 
as a result of the vacancy supersaturation produced by 
oxidation. For the double loops, it is observed that a 
stacking fault is created by vacancy absorption at the 
growing outer perimeter of the loop and is destroyed 
at the growing inner perfect loop. The perfect regions 
expand faster than the outer stacking fault, since the 
addition of a vacancy to the inner loop decreases the 
energy of the defect by yB 2 whereas the addition of a 
vacancy to the outer loop increases the energy by the 
same amount. This effect is further enhanced as the 
two loops approach each other due to vacancy transfer 
from the outer to inner loops. Eventually the two loops 
coalesce to give a perfect prismatic loop of Burgers 
vector c = [0001] which continues to grow under the 
vacancy supersaturation. The outer loop growth rate is 
thus given by 

r 0 = -[2 nD/B In (L/b)][(c s /c 0 ) - exp (yB 2 /kT)] 

(4.18) 

when the vacancy supersaturation term (c s /c a ) is larger 
than the elastic force term tending to shrink the loop. 
The inner loop growth rate is 

r, = — [2 jiD/B In (L/b)][(c s /c 0 ) - exp (-yB 2 /kT)] 

(4.19) 

where exp (— yB 2 /kT) <5C 1, and the resultant prismatic 
loop growth rate is 

r p = -[nD/B\n(L/b)]{(c s /c 0 ) - [(ab/r) + 1]} 

(4.20) 

where (ab/r) < 1 and can be neglected. By measuring 
these three growth rates, values for y, (c s /c a ) and 


D may be determined; Mg has been shown to have 
y = 125 mJ/m 2 from such measurements. 

4.7.2 Voids 

Voids will sinter on annealing at a temperature where 
self-diffusion is appreciable. The driving force for 
sintering arises from the reduction in surface energy 
as the emission of vacancies takes place from the 
void surface. In a thin metal foil the rate of annealing 
is generally controlled by the rate of diffusion of 
vacancies away from the defect to any nearby sinks, 
usually the foil surfaces. The rate then depends on the 
vacancy concentration gradient developed between the 
defect (where the vacancy concentration is given by 

c = Co exp {(dF/dn)/kT} (4.21) 

with (dF/dn) the change in free energy of the defect 
configuration per vacancy emitted at the temperature 
T) and the foil surface where the concentration is the 
equilibrium value c Q . 

For a void in equilibrium with its surroundings 
the free energy F — An r 2 y, and since (dF/dn) = 
(dF /dr) (dr/ dn) = (8nry s )(Q/4nr 2 ) where £2 is the 
atomic volume and n the number of vacancies in the 
void, equation (4.14), the concentration of vacancies 
in equilibrium with the void is 

c v = c 0 exp (2y s Q/rkT) 

Assuming spherical diffusion geometry, the diffu- 
sion equation may be solved to give the rate of shink- 
age of a void as 

dr/dt = — (D/r){exp (ISlyJrkT) - 1} (4.22) 

For large r(>50 nm) the exponential term can be 
approximated to the first two terms of the series 
expansion and equation (4.22) may then be integrated 
to give 

r 3 = r? - (6DQ.yJkT)t (4.23) 

where r x is the initial void radius at t = 0. By observing 
the shrinkage of voids as a function of annealing time 
at a given temperature (see Figure 4.58) it is possible 
to obtain either the diffusivity D or the surface energy 
y s . From such observations, y s for aluminium is shown 
to be 1.14 J/m 2 in the temperature range 150-200°C, 
and D = 0.176 x exp (—1.31 eV/kT). It is difficult 
to determine y s for A1 by zero creep measurements 
because of the oxide. This method of obtaining y s 
has been applied to other metals and is particularly 
useful since it gives a value of y s in the self-diffusion 
temperature range rather than near the melting point. 

4.7.3 Nuclear irradiation effects 

4.7.3. 1 Behaviour of point defects and 
dislocation loops 

Electron microscopy of irradiated metals shows that 
large numbers of small point defect clusters are formed 
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Figure 4.59 A thin film of copper after bombardment with 
1 A x 10 21 a-particles m~ 2 . (a) Dislocation loops (~40 nm 
dia) and small centres of strain (~4 nm dia); (b) after a 
2-hour anneal at 350° C showing large prismatic loops 
(after Barnes and Mazey, 1960). 


on a finer scale than in quenched metals, because 
of the high supersaturation and low diffusion dis- 
tance. Bombardment of copper foils with 1.4 x 10 21 
38 MeV a-particles m“ 2 produces about 10 21 m -3 dis- 
location loops as shown in Figure 4.59a; a denuded 
region 0.8 p m wide can also be seen at the grain 
boundary. These loops, about 40 nm diameter, indi- 
cate that an atomic concentration of ~ 1.5 x 10~ 4 point 
defects have precipitated in this form. Heavier doses 
of a-particle bombardment produce larger diameter 


loops, which eventually appear as dislocation tangles. 
Neutron bombardment produces similar effects to a- 
particle bombardment, but unless the dose is greater 
than 10 21 neutrons/m 2 the loops are difficult to resolve. 
In copper irradiated at pile temperature the density 
of loops increases with dose and can be as high as 
10 14 m -2 in heavily bombarded metals. 

The micrographs from irradiated metals reveal, in 
addition to the dislocation loops, numerous small cen- 
tres of strain in the form of black dots somewhat less 
than 5 nm diameter, which are difficult to resolve (see 
Figure 4.59a). Because the two kinds of clusters differ 
in size and distribution, and also in their behaviour on 
annealing, it is reasonable to attribute the presence of 
one type of defect, i.e. the large loops, to the aggrega- 
tion of interstitials and the other, i.e. the small dots, to 
the aggregation of vacancies. This general conclusion 
has been confirmed by detailed contrast analysis of the 
defects. 

The addition of an extra (111) plane in a crys- 
tal with fee structure (see Figure 4.60) introduces two 
faults in the stacking sequence and not one, as is the 
case when a plane of atoms is removed. In conse- 
quence, to eliminate the fault it is necessary for two 
partial dislocations to slip across the loop, one above 
the layer and one below, according to a reaction of 
the form 

a a - a - a — 

« [1 1 1] + - [112] + -[121] -> -[Oil] 

3 6 6 2 

The resultant dislocation loop formed is identical 
to the prismatic loop produced by a vacancy cluster 
but has a Burgers vector of opposite sign. The size 
of the loops formed from interstitials increases with 
the irradiation dose and temperature, which suggests 
that small interstitial clusters initially form and sub- 
sequently grow by a diffusion process. In contrast, 
the vacancy clusters are much more numerous, and 
although their size increases slightly with dose, their 
number is approximately proportional to the dose and 
equal to the number of primary collisions which occur. 
This observation supports the suggestion that vacancy 
clusters are formed by the redistribution of vacancies 
created in the cascade. 

Changing the type of irradiation from electron, to 
light charged particles such as protons, to heavy ions 
such as self-ions, to neutrons, results in a progres- 
sive increase in the mean recoil energy. This results 
in an increasingly non-uniform point defect generation 
due to the production of displacement cascades by pri- 
mary knock-ons. During the creation of cascades, the 
interstitials are transported outwards (see Figure 4.7), 
most probably by focused collision sequences, i.e. 
along a close-packed row of atoms by a sequence of 
replacement collisions, to displace the last atom in this 
same crystallographic direction, leaving a vacancy-rich 
region at the centre of the cascade which can collapse 
to form vacancy loops. As the irradiation tempera- 
ture increases, vacancies can also aggregate to form 
voids. 



Defects in solids 121 



(a) 


Normal Extrinsic Intrinsic 

f.c.c fault fault 



(b) 


Figure 4.60 (a) Single (A) and double (B) dislocation loops 
in proton-irradiated copper ( x43 000). (b) Structure of a 
double-dislocation loop (after Mazey and Barnes, 1968; 
courtesy of Taylor and Francis ). 


Frank sessile dislocation loops, double-faulted 
loops, tetrahedra and voids have all been observed 
in irradiated metals, but usually under different 
irradiation conditions. Results from Cu, Ag and Au 
show that cascades collapse to form Frank loops, some 
of which dissociate towards stacking fault tetrahedra. 
The fraction of cascades collapsing to form visible 
loops, defined as the defect yield, is high, ~0.5 in 
Cu to 1.0 in Au irradiated with self-ions. Moreover, 
the fraction of vacancies taking part in the collapse 
process, expressed as the cascade efficiency, is also 
high (~0.3 to 0.5). Vacancy loops have been observed 
on irradiation at R.T. in some bcc metals (e.g. Mo, 
Nb, W, of-Fe). Generally, the loops are perfect with 
b = a/ 2(1 1 1) although they are thought to nucleate 
as a/ 2(1 10) faulted loops on {1 1 0} but unfault at an 
early stage because of the high stacking-fault energy. 
Vacancy loops have also been observed in some cph 
metals (e.g. Zr and Ti). 


Interstitial defects in the form of loops are 
commonly observed in all metals. In fee metals Frank 
loops containing extrinsic faults occur in Cu, Ag, Au, 
Ni, A1 and austenitic steels. Clustering of interstitials 
on two neighbouring (111) planes to produce an 
intrinsically faulted defect may also occur, as shown 
in Figure 4.60. In bcc metals they are predominantly 
perfect a/2(\ 11). 

The damage produced in cph metals by electron 
irradiation is very complex and for Zn and Cd (c/a > 
1.633) several types of dislocation loops, interstitial 
in nature, nucleate and grow; thus c/2 loops, i.e. 
with b = [c/2], c-loops, (c/2 + p) loops, i.e. with 
b = |(2023), [c/2] + [c/2] loops and (c/2 + p) + 
(c/2 — p) loops are all formed; in the very early 
stages of irradiation most of the loops consist of [c/2] 
dislocations, but as they grow a second loop of b = 
[c/2] forms in the centre, resulting in the formation of a 
[c/2] + [c/2] loop. The (c/2 + p) + (c/2 — p) loops 
form either from the nucleation of a (c/2 + p) loop 
inside a (c/2 — p) loop or when a [c/2] + [c/2] loop 
shears. At low dose rates and low temperatures many 
of the loops facet along (1 12 0) directions. 

In magnesium with c/ a almost ideal the nature of 
the loops is very sensitive to impurities, and interstitial 
loops with either b = |(1 120) on non-basal planes 
or basal loops with b = (c/2 + p) have been observed 
in samples with different purity. Double loops with 
b = (c/2 + p) + (c/2 — p) also form but no c/2-loops 
have been observed. 

In Zr and Ti (c/a < 1.633) irradiated with either 
electrons or neutrons both vacancy and interstitial 
loops form on non-basal planes with b = ^ ( 1 120). 
Loops with a c-component, namely b = ^( 1123) on 
{1010} planes and b = c/2 on basal planes have also 
been observed; voids also form in the temperature 
range 0.3-0.6T m . The fact that vacancy loops are 
formed on electron irradiation indicates that cascades 
are not essential for the formation of vacancy loops. 
Several factors can give rise to the increased stability 
of vacancy loops in these metals. One factor is the pos- 
sibility of stresses arising from oxidation or anisotropic 
thermal expansion, i.e. interstitial loops are favoured 
perpendicular to a tensile axis and vacancy loops par- 
allel. A second possibility is impurities segregating to 
dislocations and reducing the interstitial bias. 

4. 7. 3. 2 Radiation growth and swelling 
In non-cubic materials, partitioning of the loops on 
to specific habit planes can lead to an anisotropic 
dimensional change, known as irradiation growth. The 
aggregation of vacancies into a disc- shaped cavity 
which collapses to form a dislocation loop will give 
rise to a contraction of the material in the direction of 
the Burgers vector. Conversely, the precipitation of a 
plane of interstitials will result in the growth of the 
material. Such behaviour could account for the growth 
which takes place in o'-uranium single crystals during 
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neutron irradiation, since electron micrographs from 
thin films of irradiated uranium show the presence of 
clusters of point defects. 

The energy of a fission fragment is extremely high 
(~200 MeV) so that a high concentration of both 
vacancies and interstitials might be expected. A dose 
of 10 24 n m -2 at room temperature causes uranium to 
grow about 30% in the [0 1 0] direction and contract 
in the [100] direction. However, a similar dose at 
the temperature of liquid nitrogen produces ten times 
this growth, which suggests the preservation of about 
10 4 interstitials in clusters for each fission event that 
occurs. Growth also occurs in textured poly crystalline 
o'-uranium and to avoid the problem a random texture 
has to be produced during fabrication. Similar effects 
can be produced in graphite. 

During irradiation vacancies may aggregate to form 
voids and the interstitials form dislocation loops. The 
voids can grow by acquiring vacancies which can be 
provided by the climb of the dislocation loops. How- 
ever, because these loops are formed from interstitial 
atoms they grow, not shrink, during the climb process 
and eventually become a tangled dislocation network. 

Interstitial point defects have two properties 
important in both interstitial loop and void growth. 
First, the elastic size interaction (see Chapter 7) causes 
dislocations to attract interstitials more strongly than 
vacancies and secondly, the formation energy of an 
interstitial E\ is greater than that of a vacancy E} 
so that the dominant process at elevated temperatures 
is vacancy emission. The importance of these factors 
to loop stability is shown by the spherical diffusion- 
controlled rate equation 

dr 1 f 

, T ) DyCy Z[D[C{ 

d t b I 


-D y c 0 exp 


(F e i + y)b 2 


k T 


(4.24) 


For the growth of voids during irradiation the spherical 
diffusion equation 


dr 

dt 


[1 + (pr) 1 / 2 ]D v Cy - [1 + (Z i pr) 1 / 2 ]AC 
■[(2 y s /r) — ™ 


r |-[1 + (pr) 1/2 ]D v co exp 


kr 


has been developed, where c\ and c v are the interstitial 
and vacancy concentrations, respectively, D y and D x 
their diffusivities, y s the surface energy and Z\ is a 
bias term defining the preferred attraction of the loops 
for interstitials. 

At low temperatures, voids undergo bias-driven 
growth in the presence of biased sinks, i.e. dislocation 
loops or network of density p. At higher tempera- 
tures when the thermal emission of vacancies becomes 
important, whether voids grow or shrink depends on 
the sign of [(2y s /r) — P]. During neutron irradiation 
when gas is being created continuously the gas pres- 
sure P > 2 y s /r and a flux of gas atoms can arrive at 
the voids causing gas-driven growth. 



Figure 4.61 Plots of void swelling versus irradiation 
temperature for 1050° C solution-treated Type 316 irradiated 
with 1 MeV electrons and 46 MeV Ni 6+ to a dose of 40 dpa 
( after Nelson and Hudson, p. 19). 


The formation of voids leads to the phenomenon 
of void swelling and is of practical importance in 
the dimensional stability of reactor core components. 
The curves of Figure 4.61 show the variation in total 
void volume as a function of temperature for solution- 
treated Type 316 stainless steel; the upper cut-off 
arises when the thermal vacancy emission from the 
voids exceeds the net flow into them. Comparing the 
ion- and electron-irradiated curves shows that increas- 
ing the recoil energy moves the lower threshold to 
higher temperatures and is considered to arise from 
the removal of vacancies by the formation of vacancy 
loops in cascades; cascades are not created by electron 
irradiation. 

Voids are formed in an intermediate temperature 
range ~0.3 to 0.6T m , above that for long-range single 
vacancy migration and below that for thermal vacancy 
emission from voids. To create the excess vacancy 
concentration it is also necessary to build up a critical 
dislocation density from loop growth to bias the 
interstitial flow. The sink strength of the dislocations, 
i.e. the effectiveness of annihilating point defects, is 
given by K 2 = Zip for interstitials and K 2 = Z vp for 
vacancies where (Zj — Z v ) is the dislocation bias for 
interstitials ~10% and p is the dislocation density. As 
voids form they also act as sinks, and are considered 
neutral to vacancies and interstitials, so that K\ — 
K 2 = 47rr v C v , where r v and C v are the void radius 
and concentration, respectively. 

The rate theory of void swelling takes all these fac- 
tors into account and (1) for moderate dislocation den- 
sities as the dislocation structure is evolving, swelling 
is predicted to increase linearly with irradiation dose, 
(2) when p reaches a quasi- steady state the rate should 
increase as (dose) 3/2 , and (3) when the void density is 
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very high, i.e. the sink strength of the voids is greater 
than the sink strength of the dislocations ( K 2 ^ K\), 
the rate of swelling should again decrease. Results 
from electron irradiation of stainless steel show that the 
swelling rate is linear with dose up to 40 dpa (displace- 
ment per atom) and there is no tendency to a (dose) 3/2 
law, which is consistent with dislocation structure con- 
tinuing to evolve over the dose and temperature range 
examined. 

In the fuel element itself, fission gas swelling can 
occur since uranium produces one atom of gas (Kr 
and Ze) for every five U atoms destroyed. This leads 
to ~2 m 3 of gas (stp) per m 3 of U after a ‘burnup’ of 
only 0.3% of the U atoms. 

In practice, it is necessary to keep the swelling 
small and also to prevent nucleation at grain bound- 
aries when embrittlement can result. In general, 
variables which can affect void swelling include alloy- 
ing elements together with specific impurities, and 
micro structural features such as precipitates, grain size 
and dislocation density. In ferritic steels, the intersti- 
tial solutes carbon and nitrogen are particularly effec- 
tive in (1) trapping the radiation-induced vacancies and 
thereby enhancing recombination with interstitials, and 
(2) interacting strongly with dislocations and therefore 
reducing the dislocation bias for preferential annihila- 
tion of interstitials, and also inhibiting the climb rate 
of dislocations. Substitutional alloying elements with 
a positive misfit such as Cr, V and Mn with an affinity 
for C or N can interact with dislocations in combi- 
nation with interstitials and are considered to have a 
greater influence than C and N alone. 

These mechanisms can operate in fee alloys with 
specific solute atoms trapping vacancies and also elas- 
tically interacting with dislocations. Indeed the inhibi- 
tion of climb has been advanced to explain the low 
swelling of Nimonic PEI 6 nickel-based alloys. In this 
case precipitates were considered to restrict dislocation 
climb. Such a mechanism of dislocation pinning is 


likely to be less effective than solute atoms since pin- 
ning will only occur at intervals along the dislocation 
line. Precipitates in the matrix which are coherent in 
nature can also aid swelling resistance by acting as 
regions of enhanced vacancy -interstitial recombina- 
tion. TEM observations on O' precipitates in Al-Cu 
alloys have confirmed that as these precipitates lose 
coherency during irradiation, the swelling resistance 
decreases. 

4.7. 3. 3 Radiation-induced segregation, diffusion 
and precipitation 

Radiation-induced segregation is the segregation under 
irradiation of different chemical species in an alloy 
towards or away from defect sinks (free surfaces, 
grain boundaries, dislocations, etc.). The segregation 
is caused by the coupling of the different types of 
atom with the defect fluxes towards the sinks. There 
are four different possible mechanisms, which fall into 
two pairs, one pair connected with size effects and the 
other with the Kirkendall effect. 1 With size effects, 
the point defects drag the solute atoms to the sinks 
because the size of the solute atoms differs from the 
other types of atom present (solvent atoms). Thus inter- 
stitials drag small solute atoms to sinks and vacan- 
cies drag large solute atoms to sinks. With Kirkendall 
effects, the faster diffusing species move in the oppo- 
site direction to the vacancy current, but in the same 
direction as the interstitial current. The former case is 
usually called the ‘inverse Kirkendall effect’, although 
it is still the Kirkendall effect, but solute atoms rather 
than the vacancies are of interest. The most impor- 
tant of these mechanisms, which are summarized in 
Figure 4.62, appear to be (1) the interstitial size effect 
mechanism- the dragging of small solute atoms to 
sinks by interstitials -and (2) the vacancy Kirkendall 

! The Kirkendall effect is discussed in Chapter 6, 

Section 6.4.2. 
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Figure 4.62 Schematic representation of radiation-induced segregation produced by interstitial and vacancy flow to defect 
sinks. 
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Figure 4.63 Variation in the degree of long-range order S for initially (a) ordered and (b) disordered CusAufor various 
irradiation temperatures as a function of irradiation time. Accelerating voltage 600 kV ( after Hameed, Loretto and Smallman, 
1982; by courtesy of Taylor and Francis). 


effect- the migration away from sinks of fast-diffusing 
atoms. 

Radiation-induced segregation is technologically 
important in fast breeder reactors, where the high 
radiation levels and high temperatures cause large 
effects. Thus, for example, in Type 316 stainless steels, 
at temperatures in the range 350-650°C (depending 
on the position in the reactor) silicon and nickel 
segregate strongly to sinks. The small silicon atoms 
are dragged there by interstitials and the slow diffusing 
nickel stays there in increasing concentration as the 
other elements diffuse away by the vacancy inverse 
Kirkendall effect. Such diffusion (1) denudes the 
matrix of void-inhibiting silicon and ( 2 ) can cause 
precipitation of brittle phases at grain boundaries, etc. 

Diffusion rates may be raised by several orders of 
magnitude because of the increased concentration of 
point defects under irradiation. Thus phases expected 
from phase diagrams may appear at temperatures 
where kinetics are far too slow under normal 
circumstances. Many precipitates of this type have 
been seen in stainless steels which have been in 
reactors. Two totally new phases have definitely been 
produced and identified in alloy systems (e.g. Pd 8 W 
and Pd 8 V) and others appear likely (e.g. Cu-Ni 
miscibility gap). 

This effect relates to the appearance of precipitates 
after irradiation and possibly arises from the two 
effects described above, i.e. segregation or enhanced 
diffusion. It is possible to distinguish between 
these two causes by post-irradiation annealing, 
when the segregation-induced precipitates disappear 
but the diffusion-induced precipitates remain, being 
equilibrium phases. 

4. 7. 3. 4 Irradiation of ordering alloys 

Ordering alloys have a particularly interesting response 

to the influence of point defects in excess of 


the eqilibrium concentration. Irradiation introduces 
point defects and their effect on the behaviour of 
ordered alloys depends on two competitive processes, 
i.e. radiation-induced ordering and radiation-induced 
disordering, which can occur simultaneously. The 
interstitials do not contribute significantly to ordering 
but the radiation-induced vacancies give rise to 
ordering by migrating through the crystal. Disordering 
is assumed to take place athermally by displacements. 
Figure 4.63 shows the influence of electron irradiation 
time and temperature on (a) initially ordered and 
(b) initially disordered CU 3 AU. The final state of the 
alloy at any irradiation temperature is independent of 
the initial condition. At 323 K, CU 3 AU is fully ordered 
on irradiation, whether it is initially ordered or not, 
but at low temperatures it becomes largely disordered 
because of the inability of the vacancies to migrate 
and develop order; the interstitials (. E l m ~ 0. 1 eV) can 
migrate at low temperatures. 
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Chapter 5 

The characterization of materials 


5.1 Tools of characterization 

Determination of the structural character of a 
material, whether massive in form or particulate, 
crystalline or glassy, is a central activity of 
materials science. The general approach adopted 
in most techniques is to probe the material with 
a beam of radiation or high-energy particles. The 
radiation is electromagnetic in character and may be 
monochromatic or polychromatic: the electromagnetic 
spectrum (Figure 5.1) conveniently indicates the wide 


choice of energy which is available. Wavelengths 
(A) range from heat, through the visible range 
(k = 700-400 nm) to penetrating X-radiation. Using 
de Broglie’s important relation k = h/mv, which 
expresses the duality of radiation frequency and 
particle momentum, it is possible to apply the idea 
of wavelength to a stream of electrons. 

The microscope, in its various forms, is the principal 
tool of the materials scientist. The magnification 
of the image produced by an electron microscope 
can be extremely high; however, on occasion, the 



Figure 5.1 The electromagnetic spectrum of radiation (from Askeland, 1990, p. 732; by permission of Chapman and Hall). 
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modest magnification produced by a light stereo- 
microscope can be sufficient to solve a problem. 
In practical terms, the microscopist attaches more 
importance to resolution than magnification; that is, 
the ability of the microscope to distinguish fine detail. 
In a given microscope, increasing the magnification 
beyond a certain limit will fail to reveal further 
structural detail; such magnification is said to be 
‘empty’. Unaided, the human eye has a resolution of 
about 0. 1 mm: the resolution of light microscopes and 
electron microscopes are, respectively, about 200 nm 
and 0.5 nm. In order to perceive or image a structural 
feature it is necessary that the wavelength of the 
probing radiation should be similar in size to that of 
the feature. In other words, and as will be enlarged 
upon later, resolution is a function of wavelength. 

In this chapter we examine the principal ways 
in which light, X-rays, electrons, and neutrons 
are used to explore the structure of metals. 
Some degree of selectivity has been unavoidable. 
Although the prime purpose of microscopy is to 
provide qualitative information on structure, many 
complementary techniques are available that provide 
quantitative data on the chemical and physical 
attributes of a material. 


5.2 Light microscopy 

5.2.1 Basic principles 

The light microscope provides two-dimensional 
representation of structure over a total magnification 
range of roughly 40 x to 1250 x. Interpretation of such 
images is a matter of skill and experience and needs 
to allow for the three-dimensional nature of features 
observed. The main components of a bench-type 


microscope are (1) an illumination system comprising 
a light source and variable apertures, (2) an objective 
lens and an ocular lens (eyepiece) mounted at the ends 
of a cylindrical body-tube, and (3) a specimen stage 
(fixed or rotatable). Metallic specimens that are to 
be examined at high magnifications are successively 
polished with 6, 1 and sometimes 0.25 ji M diamond 
grit. Examination in the as-polished condition, which is 
generally advisable, will reveal structural features such 
as shrinkage or gas porosity, cracks and inclusions of 
foreign matter. Etching with an appropriate chemical 
reagent is used to reveal the arrangement and size 
of grains, phase morphology, compositional gradients 
(coring), orientation-related etch pits and the effects 
of plastic deformation. Although actually only a 
few atomic diameters wide, grain boundaries are 
preferentially and grossly attacked by many etchants. 
In bright-held illumination, light is reflected back 
towards the objective from reflective surfaces, causing 
them to appear bright. Dark-held illumination reverses 
this effect, causing grain boundaries to appear bright. 
The degree of chemical attack is sensitive to crystal 
orientation and an etched poly crystalline aggregate will 
often display its grain structure clearly (Figure 5.2a). 
Preparation techniques for ceramics are essentially 
similar to those for metals and alloys. However, their 
porosity can cause two problems. First, there is a 
risk of entrapping diamond particles during polishing, 
making ultrasonic cleaning advisable. Second, it 
may be necessary to strengthen the structure by 
impregnating with liquid resin in vacuo, provided that 
pores are interconnected. 

The objective, the most important and critical 
component in the optical train of the light microscope, 
is made up of a number of glass lenses and, sometimes, 
huorite (CaF 2 ) lenses also. Lenses are subject to 
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Figure 5.2 (a) Reflection of light from etched specimen, (b) Use of oil to improve numerical aperture of objective. 
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spherical and chromatic aberrations. Minimization 
and correction of these undesirable physical effects, 
greatly aided by modem computational techniques, 
is possible and objectives are classified according 
to the degree of correction, i.e. achromats, fluorites 
(semi-apochromats), apochromats. Lenses are usually 
coated in order to increase light transmission. As 
magnification is increased, the depth of field of 
the objective becomes smaller, typically falling from 
250 pm at 15 x to 0.08 pm at 1200x, so that specimen 
flatness becomes more critical. The focal length and 
the working distance (separating its front lens from 
the specimen) of an objective differ. For instance, an 
/ 2 mm objective may have a working distance of 
0.15 mm. 

Resolution, rather than magnification, is usually the 
prime concern of the skilled microscopist. It is the 
smallest separating distance (5) that can be discerned 
between two lines in the image. The unaided eye, at the 
least distance of comfortable vision (about 250 mm), 
can resolve 0.1 mm. Confusingly, the resolution value 
for a lens with a so-called high resolving power is 
small. Resolution is determined by (1) the wavelength 
(A) of the radiation and (2) the numerical aperture 
(NA) of the objective and is expressed by the Abbe 
formula 8 = A./2NA. 

The numerical aperture value, which is engraved 
upon the side of the objective, indicates the light- 
gathering power of the compound lens system and 
is obtained from the relation NA = n sin a , where n 
is the refractive index of the medium between the 
front lens face of the objective and the specimen, 
and a is the semi-apex angle of the light cone 
defined by the most oblique rays collected by the 
lens. Numerical apertures range in typical value from 
0.08 to 1.25. Despite focusing difficulties and the 
need for costly lenses, efforts have been made to use 
short-wavelength ultraviolet radiation: developments 
in electron microscopy have undermined the feasibility 
of this approach. Oil-immersion objectives enable the 
refractive index term to be increased (Figure 5.2b). 
Thus, by replacing air (n = 1) with a layer of cedar 
wood oil ( n = 1.5) or monobromonaphthalene ( n = 
1.66), the number of rays of reflected light accepted 
by the front lens of the objective is increased and 
resolution and contrast are improved. The range 
of wavelengths for visible light is approximately 
400-700 nm; consequently, using the Abbe formula, 
it can readily be shown that the resolution limit 
of the light microscope is in the order of 200 nm. 
The ‘useful’ range of magnification is approximately 
500-1000 NA. The lower end of the range can be 
tiring to the eyes; at the top end, oil-immersion 
objectives are useful. 

Magnification is a subjective term; for instance, it 
varies with the distance of an image or object from 
the eye. Hence, microscopists sometimes indicate this 
difficulty by using the more readily defined term ‘scale 
of reproduction’, which is the lineal size ratio of an 
image (on a viewing screen or photomicrograph) to 


the original object. Thus, strictly speaking, a statement 
such as 500 x beneath a photomicrograph gives the 
scale of reproduction, not the magnification. 

The ocular magnifies the image formed by the 
objective: the finally-observed image is virtual. It 
can also correct for certain objective faults and, in 
photomicrography, be used to project a real image. The 
ocular cannot improve the resolution of the system but, 
if inferior in quality, can worsen it. The most widely 
used magnifications for oculars are 8x and 12.5 x. 

Two dimensional features of a standard bench 
microscope, the mechanical tube length t m and optical 
tube length t Q , are of special significance. The former 
is the fixed distance between the top of the body 
tube, on which the ocular rests, and the shoulder of 
the rotatable nosepiece into which several objectives 
are screwed. Objectives are designed for a certain 
t m value. A value of 160 mm is commonly used. 
(In Victorian times, it was 250 mm, giving a rather 
unwieldy instrument.) The optical tube length t Q is 
the distance between the front focal point of the 
ocular and the rear focal plane of the objective. 
Parfocalization, using matched parfocal objectives and 
oculars, enables the specimen to remain in focus when 
objectives are step-changed by rotating the nosepiece. 
With each change, t 0 changes but the image produced 
by the objective always forms in the fixed focal 
phase of the ocular. Thus the distance between the 
specimen and the aerial image is kept constant. Some 
manufacturers base their sequences of objective and 
ocular magnifications upon preferred numbers 1 rather 
than upon a decimal series. This device facilitates the 
selection of a basic set of lenses that is comprehensive 
and ‘useful’ (exempt from ‘empty’ magnification). For 
example, the Michel series of 6.3 x, 8x, 10 x, 12.5 x, 
16 x, 20 x, 25 x, etc., a geometrical progression with a 
common ratio of approximately 1.25, provides a basis 
for magnification values for objectives and oculars. 
This rational approach is illustrated in Figure 5.3. 
Dot -dash lines represent oculars and thin solid 
lines represent objectives. The bold lines outline a 
box within which objective/ocular combinations give 
‘useful’ magnifications. Thus, pairing of a 12.5 x 
ocular with a 40 x objective (NA = 0.65) gives a 
‘useful’ magnification of 500 x. 

5.2.2 Selected microscopical techniques 

5.2.2. 1 Phase-contrast microscopy 
Phase-contrast microscopy is a technique that enables 
special surface features to be studied even when 

'The valuable concept of preferred numbers/sizes, currently 
described in document PD 6481 of the British Standards 
Institution, was devised by a French military engineer, 
Colonel Charles Renard (1847-1905). In 1879, during the 
development of dirigible (steerable) balloons, he used a 
geometrical progression to classify cable diameters. A 
typical Renard Series is 1.25, 1.6, 2.0, 2.5, 3.2, 4.0, 5.0, 6.4, 
8.0, etc. 
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Figure 5.3 Range of ‘useful’ magnification in light 
microscope ( from Optical Systems for the Microscope, 1967, 
p. 15; by courtesy of Carl Zeiss, Germany ). 

there is no colour or reflectivity contrast. The light 
reflected from a small depression in a metallographic 
specimen will be retarded in phase by a fraction 
of a light wavelength relative to that reflected from 
the surrounding matrix and, whereas in ordinary 


microscopy a phase difference in the light collected by 
the objective will not contribute to contrast in the final 
image, in phase-contrast microscopy small differences 
in phases are transformed into differences in brightness 
which the eye can detect. 

General uses of the technique include the 
examination of multi-phased alloys after light etching, 
the detection of the early stages of precipitation, 
and the study of cleavage faces, twins and other 
deformation characteristics. The optimum range of 
differences in surface level is about 20-50 nm, 
although under favourable conditions these limits 
may be extended. A schematic diagram of the basic 
arrangement for phase contrast in the metallurgical 
microscope is shown in Figure 5.4a. A hollow cone 
of light produced by an annulus A, is reflected by the 
specimen and brought to an image in the back focal 
plane of the objective. A phase plate of suitable size 
should, strictly, be positioned in this plane but, for the 
ease of interchangeability of phase plates, the position 
Q in front of the eyepiece E is often preferred. This 
phase plate has an annulus, formed either by etching 
or deposition, such that the light it transmits is either 
advanced or retarded by a quarter of a wavelength 
relative to the light transmitted by the rest of the 
plate and, because the light reflected from a surface 
feature is also advanced or retarded by approximately 
A./4, the beam is either in phase or approximately k/2 
or 7 r out of phase with that diffracted by the surface 
features of the specimen. Consequently, reinforcement 
or cancellation occurs, and the image intensity at any 
point depends on the phase difference produced at 
the corresponding point on the specimen surface, and 
this in turn depends upon the height of this point 
relative to the adjacent parts of the surface. When 
the light passing through the annulus is advanced in 
phase, positive phase contrast results and areas of 
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Figure 5.4 Schematic arrangement of microscope system for (a) phase-contrast and (b) polarized light microscopy. 
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the specimen which are proud of the matrix appear 
bright and depressions dark; when the phase is retarded 
negative contrast is produced and ‘pits’ appear bright 
and ‘hills’ dark. 

5. 2. 2. 2 Polarized- light microscopy 
The basic arrangement for the use of polarized light is 
shown in Figure 5.4b. The only requirements of this 
technique are that the incident light on the specimen be 
plane-polarized and that the reflected light be analysed 
by a polarizing unit in a crossed relation with respect 
to the polarizer, i.e. the plane of polarization of the 
analyser is perpendicular to that of the polarizer. 

The application of the technique depends upon the 
fact that plane-polarized light striking the surface of 
an optically isotropic metal is reflected unchanged if it 
strikes at normal incidence. If the light is not at normal 
incidence the reflected beam may still be unchanged 
but only if the angle of incidence is in, or at right 
angles to, the plane of polarization, otherwise it will 
be elliptically polarized. It follows that the unchanged 
reflected beam will be extinguished by an analyser in 
the crossed position whereas an elliptically polarized 
one cannot be fully extinguished by an analyser in 
any position. When the specimen being examined is 
optically anisotropic, the light incident normally is 
reflected with a rotation of the plane of polarization and 
as elliptically polarized light; the amount of rotation 
and of elliptical polarization is a property of the metal 
and of the crystal orientation. 

If correctly prepared, as-polished specimens of 
anisotropic metals will ‘respond’ to polarized light and 
a grain contrast effect is observed under crossed polar s 
as a variation of brightness with crystal orientation. 
Metals which have cubic structure, on the other hand, 
will appear uniformly dark under crossed polars, unless 
etched to invoke artificial anisotropy, by producing 
anisotropic surface films or well-defined pits. An etch 
pit will reflect the light at oblique incidence and 
elliptically-polarized light will be produced. However, 
because such a beam cannot be fully extinguished 
by the analyser in any position, it will produce a 
background illumination in the image which tends to 
mask the grain contrast effect. 

Clearly, one of the main uses of polarized light is to 
distinguish between areas of varying orientation, since 
these are revealed as differences of intensity under 
crossed polars. The technique is, therefore, very useful 
for studying the effects of deformation, particularly the 
production of preferred orientation, but information on 
cleavage faces, twin bands and sub-grain boundaries 
can also be obtained. If a ‘sensitive tint’ plate is 
inserted between the vertical illuminator and the 
analyser each grain of a sample may be identified by 
a characteristic colour which changes as the specimen 
is rotated on the stage. This application is useful in 
the assessment of the degree of preferred orientation 
and in recrystallization studies. Other uses of polarized 
light include distinguishing and identifying phases in 
multi-phase alloys. 


Near-perfect extinction occurs when the polars of 
a transmission microscope are crossed. If a thin 
section or slice of ceramic, mineral or rock is 
introduced and the stage slowly rotated, optically 
anisotropic crystals will produce polarization colours, 
developing maximum brilliance at 45° to any of 
the four symmetrical positions of extinction. The 
colour of a crystal depends upon its birefringence, 
or capacity for double-refraction, and thickness. By 
standardizing the thickness of the section at 30-50 pm 
and using a Michel-Levy colour chart, it is possible to 
identify crystalline species. In refractory materials, it is 
relatively easy to identify periclase (MgO), chromite 
(FeCr0 4 ), tridymite (Si0 2 ) and zircon (ZrSi0 4 ) by 
their characteristic form and colour. 

As birefringence occurs within the crystal, each 
incident ray forms ordinary and extraordinary rays 
which are polarized in different planes and travel 
through the crystal at different velocities. On leaving 
the analyser, these out-of-phase ‘fast’ and ‘slow’ 
rays combine to produce the polarization colour. 
This colour is complementary to colour cancelled by 
interference and follows Newton’s sequence: yellow, 
orange, red, violet, blue, green. More delicate, higher- 
order colours are produced as the phase difference 
between the emergent rays increases. Anisotropic 
crystals are either uniaxial or biaxial, having one or 
two optic axes, respectively, along which birefringence 
does not occur. (Optic axes do not necessarily 
correspond with crystallographic axes.) It is therefore 
possible for quartz (uniaxial) and mica (biaxial) 
crystals to appear black because of their orientation 
in the slice. Uniaxial (tetragonal and hexagonal 
systems) can be distinguished from biaxial crystals 
(orthorhombic, triclinic and monoclinic systems) by 
introducing a Bertrand lens into the light train of the 
microscope to give a convergent beam, rotating the 
stage and comparing their interference figures: uniaxial 
crystals give a moving ‘ring and brush’ pattern, biaxial 
crystals give two static ‘eyes’. Cubic crystals are 
isotropic, being highly symmetrical. Glassy phases are 
isotropic and also appear black between crossed polars; 
however, glass containing residual stresses from rapid 
cooling produces fringe patterns and polarization 
colours. The stress-anisotropic properties of plastics 
are utilized in photoelastic analyses of transparent 
models of engineering structures or components made 
from standard sheets of constant thickness and stress- 
optic coefficient (e.g. clear Bakelite , epoxy resin). 
The fringe patterns produced by monochromatic 
light and crossed polars in a polariscope reveal the 
magnitude and direction of the principal stresses 
that are developed when typical working loads are 
applied. 

5.2.2. 3 Hot-stage microscopy 
The ability to observe and photograph phase 
transformations and structural changes in metals, 
ceramics and polymers at high magnifications while 
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being heated holds an obvious attraction. Various 
designs of microfumace cell are available for mounting 
in light microscope systems. 

For studies at moderate temperatures, such 
as spherulite formation in a cooling melt of 
polypropylene, the sample can be placed on a glass 
slide, heated in a stage fitment, and viewed through 
crossed polars with transmitted light. For metals, 
which have an increasing tendency to vaporize as 
the temperature is raised, the polished sample is 
enclosed in a resistance-heated microfurnace and 
viewed by reflected light through an optically worked 
window of fused silica. The chamber can be either 
evacuated (10 -6 torr) or slowly purged with inert gas 
(argon). The later must be dry and oxygen-free. A 
Pt:Pt-10Rh thermocople is inserted in the specimen. 
The furnace should have a low thermal inertia and 
be capable of heating or cooling the specimen at 
controlled rates; temperatures of up to 1800°C are 
possible in some designs. The presence of a window, 
and possibly cooling devices, drastically reduces the 
available working distance for the objective lens, 
particularly when a large numerical aperture or high 
magnification are desired. One common solution is to 
use a Burch-type reflecting objective with an internal 
mirror system which gives a useful working distance of 
13-14 mm. The type of stage unit described has been 
used for studies of grain growth in austenite and the 
formation of bainite and martensite in steels, allotropic 
transformations, and sintering mechanisms in powder 
compacts. 

When polished polycrystalline material is heated, 
individual grains tend to reduce their volume as 
a result of surface tension and grain boundaries 
appear as black lines, an effect referred to as thermal 
etching or grooving. If a grain boundary migrates, 
as in the grain growth stage of annealing, ghost 
images of former grooves act as useful markers. As 
the melting point is approached, there is often a 
noticeable tendency for grain boundary regions to fuse 
before the bulk grains; this liquation effect is due 
to the presence of impurities and the atomic misfit 
across the grain boundary surface. When interpreting 
the visible results of hot-stage microscopy, it is 
important to bear in mind that surface effects do 
not necessarily reflect what is happening within the 
bulk material beneath the surface. The technique can 
produce artefacts; the choice between evacuation and 
gas-purging can be crucial. For instance, heating in 
vacuo can favour decarburization and grain-coarsening 
in steel specimens. 

The classic method for studying high-temperature 
phases and their equilibria in oxide systems was 
based upon rapid quenching (e.g. silicates). This 
indirect method is slow and does not always 
preserve the high-temperature phase(s). A direct 
microscopical technique uses the U-shaped notch of 
a thermocouple hot junction as the support for a 


small non-metallic sample. In the original design, 1 
the junction was alternately connected by high-speed 
relay at a frequency of 50 Hz to a power circuit and 
a temperature-measuring circuit. The sample could 
be heated to temperatures of up to 2150°C and it 
was possible to observe crystallization from a melt 
directly through crossed polars. Although unsuitable 
for metals and highly volatile materials, the technique 
has been applied to glasses, slags, refractories, Portland 
cements, etc., providing information on phase changes, 
devitrification, sintering shrinkage, grain growth and 
the ‘wetting’ of solids by melts. 

5. 2. 2. 4 Microhardness testing 
The measurement of hardness with a microscope 
attachment, comprising a diamond indentor and 
means for applying small loads, dates back more 
than 50 years. Initially used for small components 
(watch gears, thin wire, foils), microhardness testing 
was extended to research studies of individual 
phases, orientation effects in single crystals, diffusion 
gradients, ageing phenomena, etc. in metallic and 
ceramic materials. Nowadays, testing at temperatures 
up to 1000°C is possible. In Europe, the pyramidal 
Vickers-type (interfacial angle 136°) indentor, which 
produces a square impression, is generally favoured. 
Its counterpart in general engineering employs test 
loads of 5-100 kgf: in microhardness testing, typical 
test loads are in the range 1-100 gf (1 gf = 1 pond = 
1 p = 9.81 mN). A rhombic-based Knoop indentor of 
American origin has been recommended for brittle 
and/or anisotropic material (e.g. carbides, oxides, 
glass) and for thin foils and coatings where a shallow 
depth of impression is desired. The kite-shaped Knoop 
impression is elongated, with a 7:1 axial ratio. 

Microhardness tests need to be very carefully con- 
trolled and replicated, using as large a load as possible. 
The surface of the specimen should be strain-free (e.g. 
electropolished), plane and perpendicular to the inden- 
tor axis. The indentor is lowered slowly at a rate of 
<1 mm min -1 under vibration-free conditions, even- 
tually deforming the test surface in a manner analogous 
to steady- state creep. This condition is achieved within 
15 s, a test period commonly used. 

The equations for Vickers hardness (H v ) and Knoop 
hardness (H K ) take the following forms: 

H w = 1854.4(P/d 2 ) kgf mm- 2 (5.1) 

H K = 14228(P/d 2 ) kgf mm“ 2 (5.2) 

In these equations, which have the dimensions of 
stress, load P and diagonal length d are measured in 
gf and jiim, respectively. The first equation is based 
upon the surface area of the impression; the second 
is based upon its projected area and the length of the 
long diagonal. 

developed by W. Gutt and co-workers at the Building 
Research Station, Watford. 
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The main potential difficulty concerns the possi- 
ble dependence of microhardness values (H m ) upon 
test load. As the test load is reduced below a cer- 
tain threshold, the measured microhardness value may 
tend to decrease or increase, depending upon the mate- 
rial. In these circumstances, when measuring absolute 
hardness rather than relative hardness, it is useful to 
consider the material’s behaviour in terms of the Meyer 
equation which relates indenting force P to the diag- 
onal length of the Vickers-type impression produced, 
d, as follows: 

P = kd n (5.3) 

The Meyer exponent n expresses the strain-hardening 
characteristics of the material as it deforms plastically 
during the test; it increases in value with the degree 
of strain-hardening. For simple comparisons of rela- 
tive microhardness, hardness values at a fixed load can 
be compared without allowance for load-dependence. 
On the other hand, if absolute values of hardness are 
required from low-load tests, it is advisable to deter- 
mine the Meyer line for the particular material over 
a comparatively small load range by plotting P val- 
ues against the corresponding d values, using log-log 
graph paper. (Extrapolations beyond the chosen load 
range are unwise because the profile of the Meyer 
line may change.) Figure 5.5 shows the Meyer line, 
slope n, for a material giving load-dependent micro- 
hardness values. The slope n is less than 2, which 
is usual. The H m curve has a negative slope and 
microhardness values increase as the load increases. 
One way of reporting load-dependent microhardness 
results is to state three hardness numbers in terms of 
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Figure 5.5 Meyer line for material with load-independent 
hardness (by courtesy of Carl Zeiss, Germany). 


a standard set of three diagonal d-values, as shown in 
Figure 5.5. The approximate values for the set shown 
are H 5 ^ m = 160, H l0fim = 140, H 2 o^ m = 120. When 
the anisotropy ratio for elastic moduli is high, micro- 
hardness values can vary greatly from grain to grain 
in polycrystalline material. 

Combination of the Vickers equation with the Meyer 
equation gives the following expression: 

Hy = constant x d n ~ 2 (5.4) 

Accordingly, if n = 2, which is true for the conven- 
tional Vickers macrohardness test, the gradient of the 
H m line becomes zero and hardness values are conve- 
niently load-independent. 

5. 2. 2. 5 Quantitative microscopy 
Important standard methods for measuring grain size 
and contents of inclusions and phases have evolved 
in metallurgy and mineralogy. Grain size in metal- 
lic structures is commonly assessed by the ASTM 
(McQuaid-Ehn) method in which etched microstruc- 
tures are compared with diagrams or standard sets of 
photographs at a standard magnification, say lOOx. 
Numerous manual methods have been developed to 
assess the cleanliness of steels. They range from 
direct methods, in which particles beneath linearly- 
traversed crosswires are individually counted, to com- 
parative methods based on photomicrographs or charts 
(e.g. Fox, Jemkontoret (JK), SAE-ASTM, Diergarten). 
Sometimes these methods attempt to identify inclu- 
sions according to type and form. Nowadays, auto- 
matic systems are available to convert microscopical 
information, from a light or electron microscope, to 
electronic signals that can be rapidly processed and 
evaluated. Although more objective than former meth- 
ods, they must be set up correctly and their limitations 
appreciated. For instance, the plane of sectioning is 
critical when highly-directional features are present, 
such as filaments in composites and slag stringers in 
metals. Quantitative methods have made it possible 
to relate many micro structural characteristics, such as 
volume fractions, interparticle distances, grain bound- 
ary area per unit volume, interlamellar spacing, etc., 
to mechanical properties. In 1848, the French petro- 
grapher Delesse established mathematically that, in a 
fully random cross-section of a uniform aggregate, the 
area fraction of a given microconstituent (phase) in 
a field of view is equal to the corresponding volume 
fraction in the three-dimensional aggregate. There are 
three basic methods for measuring the area fraction, as 
the following exercise demonstrates. 

Suppose that a certain field of view contains a 
dark phase and a light-coloured matrix, as shown in 
Figure 5.6a. Using the systematic notation for stereol- 
ogy given in Table 5.1, the total area occupied by the 
dark phase in the test area A T (= L 2 ) is A; it is the sum 
of i areas, each of area a. This areal fraction is A a . 
Alternatively, the field of view may be systematically 
traversed with a random test line, length L T , and a 
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Figure 5.6 Comparison of methods for measuring areal fraction. 
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Table 5.1 Stereological notation (after Underwood) 


Dimensions of symbol 


Symbol 

Dimensions 


P 

Points of intersection 

mm* 

N 

Number of objects 

mm ( 

L 

Lines 

mm 

A 

Flat surfaces 

mm 

S 

Curved surfaces 

mm 

V 

Volumes 

mm 


length L derived from the sum of j intercepts, each of 
length /. The lineal fraction L l = L/L? (Figure 5.6b). 
In the third method (Figure 5.6c), a regular grid of 
points is laid over the field and all coincidences of 
grid intersections with the dark phase counted to give 
a point count fraction P/Pj which is P P . Only one field 
of view is shown in Figure 5.6; in practice, numerous 
different fields are tested in order to give statistically- 
significant average values, i.e. A a , L l , P p . The areal 
fraction method is chiefly of historical interest. The lin- 
eal method was used in the original Quantimet instru- 
ments which scanned 1 the images electronically with a 
raster. The point count method is the basis of modern 
instruments in which electronically-scanned fields are 
composed of thousands of pixels. 

Certain three-dimensional properties of a structure 
have a great influence on a material’s behaviour 
but cannot be measured directly by microscope. For 
instance, the amount of curved grain boundary surface 

1 Contrary to popular belief, the term ‘scan’ implies 

accuracy and fine resolution. 
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Figure 5.7 Relationships between stereological quantities. 


in unit volume Sy of a single-phase alloy is poten- 
tially more significant than average grain size. The 
diagram of Figure 5.7 shows some of the stereolog- 
ical equations that permit non-measurable quantities 
to be calculated. Measurable quantities are enclosed 
in circles, calculable quantities are enclosed in boxes 
and arrows show the possible routes in the triangu- 
lar matrix. Important stereological equations provid- 
ing the necessary links are shown below the matrix. 
Thus, the above-mentioned quantity Sy may be derived 
from intersections on test lines or traverses (P L ) since 
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2 P L = 5 V . Similarly, length of dislocation lines, acic- 
ular precipitates or slag stringers per unit volume L v 
may be obtained from point counts over an area, Pa- 

5.3 X-ray diffraction analysis 

5.3.1 Production and absorption of X-rays 

The use of diffraction methods is of great importance 
in the analysis of crystalline solids. Not only can 
they reveal the main features of the structure, i.e. 
the lattice parameter and type of structure, but also 
other details such as the arrangement of different kinds 
of atoms in crystals, the presence of imperfections, 
the orientation, sub-grain and grain size, the size and 
density of precipitates. 

X-rays are a form of electromagnetic radiation dif- 
fering from light waves (X = 400-800 nm) in that 
they have a shorter wavelength (X ~ 0.1 nm). These 
rays are produced when a metal target is bombarded 
with fast electrons in a vacuum tube. The radiation 
emitted, as shown in Figure 5.8a, can be separated 
into two components, a continuous spectrum which 
is spread over a wide range of wavelengths and a 
superimposed line spectrum characteristic of the metal 
being bombarded. The energy of the ‘white’ radiation, 
as the continuous spectrum is called, increases as the 
atomic number of the target and approximately as the 
square of the applied voltage, while the characteris- 
tic radiation is excited only when a certain critical 
voltage is exceeded. The characteristic radiation is pro- 
duced when the accelerated electrons have sufficient 
energy to eject one of the inner electrons (ls-level, for 
example) from its shell. The vacant ls-level is then 
occupied by one of the other electrons from a higher 
energy level, and during the transition an emission 
of X-radiation takes place. If the electron falls from 
an adjacent shell then the radiation emitted is known 
as Kol- radiation, since the vacancy in the first X-shell 
n = 1 is filled by an electron from the second L-shell 
and the wavelength can be derived from the relation 

h v = E l - E k (5.5) 

However, if the X-shell vacancy is filled by an electron 
from an M - shell (i.e. the next highest quantum shell) 
then X /3-radiation is emitted. Figure 5.8 shows that, in 
fact, one cannot be excited without the other, and the 
characteristic X-radiation emitted from a copper target 
is in detail composed of a strong Kol- doublet and a 
weaker X/3- line. 

In transversing a specimen, an X-ray beam loses 
intensity according to the equation 

I = To exp[— /zx] (5.6) 

where 7 0 and I are the values of the initial and 
final intensities respectively, /z is a constant, known 
as the linear absorption coefficient which depends 
on the wavelength of the X-rays and the nature of 
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Figure 5.8 (a) Distribution of X-ray intensity from a copper 
target and (b) dependence of mass absorption coefficient on 
X-ray wavelength for nickel. 

the absorber, and x is the thickness of the speci- 
men. 1 The variation of mass absorption coefficient, 
i.e. linear absorption coefficient divided by density, 
/z/p, with wavelength is of particular interest, as 
shown in Figure 5.8b, which is the curve for nickel. 
It varies approximately as X 3 until a critical value 
of A, (=0.148 nm) is reached, when the absorption 
decreases precipitously. The critical wavelength X K at 
which this decrease occurs is known as the K absorp- 
tion edge, and is the value at which the X-ray beam 
has acquired just sufficient energy to eject an elec- 
tron from the X-shell of the absorbing material. The 
value of X K is characteristic of the absorbing material, 
and similar L and M absorption edges occur at higher 
wavelengths. 

This sharp variation in absorption with wavelength 
has many applications in X-ray practice, but its most 
common use is in filtering out unwanted X/3-radiation. 
For example, if a thin piece of nickel foil is placed 
in a beam of X-rays from a copper target, absorption 
of some of the short wavelength ‘white’ radiation 
and most of the X /3-radiation will result, but the 

! This absorption equation is the basis of radiography, since 
a cavity, crack or similar defect will have a much lower 
/z-value than the sound metal. Such defects can be detected 
by the appearance of an intensity difference registered on a 
photographic film placed behind the X-irradiated object. 
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strong Xo'-radiation will be only slightly attenuated. 
This filtered radiation is sufficiently monochromatic 
for many X-ray techniques, but for more specialized 
studies when a pure monochromatic beam is required, 
crystal monochromators are used. The X-ray beam is 
then reflected from a crystal, such as quartz or lithium 
fluoride, which is oriented so that only the desired 
wavelength is reflected according to the Bragg law (see 
below). 

5.3.2 Diffraction of X-rays by crystals 

The phenomena of interference and diffraction are 
commonplace in the held of light. The standard school 
physics laboratory experiment is to determine the spac- 
ing of a grating, knowing the wavelength of the 
light impinging on it, by measuring the angles of the 
diffracted beam. The only conditions imposed on the 
experiment are that (1) the grating be periodic, and 
(2) the wavelength of the light is of the same order 
of magnitude as the spacing to be determined. This 
experiment immediately points to the application of 
X-rays in determining the spacing and inter-atomic dis- 
tances in crystals, since both are about 0.1 -0.4 nm in 
dimension. Rigorous consideration of diffraction from 
a crystal in terms of a three-dimensional diffraction 
grating is complex, but Bragg simplified the problem 
by showing that diffraction is equivalent to symmetri- 
cal reflection from the various crystal planes, provided 
certain conditions are fulfilled. Figure 5.9a shows a 
beam of X-rays of wavelength X, impinging at an angle 
6 on a set of crystal planes of spacing d. The beam 
reflected at the angle 6 can be real only if the rays 
from each successive plane reinforce each other. For 
this to be the case, the extra distance a ray, scattered 
from each successive plane, has to travel, i.e. the path 
difference, must be equal to an integral number of 
wavelengths, nX. For example, the second ray shown 
in Figure 5.9a has to travel further than the first ray 
by the distance PO + OQ. The condition for reflection 
and reinforcement is then given by 

nX = PO + OQ = 2 ON sin 0 = 2d sin <9 (5.7) 


This is the well-known Bragg law and the critical 
angular values of 0 for which the law is satisfied are 
known as Bragg angles. 

The directions of the reflected beams are determined 
entirely by the geometry of the lattice, which in turn in 
governed by the orientation and spacing of the crystal 
planes. If for a crystal of cubic symmetry we are given 
the size of the structure cell, a , the angles at which 
the beam is diffracted from the crystal planes ( hkl ) 
can easily be calculated from the interplanar spacing 
relationship 

d m) = a/V(h 2 +k? + l 2 ) (5.8) 

It is conventional to incorporate the order of reflection, 
n , with the Miller index, and when this is done the 
Bragg law becomes 

X = 2 a sin 0/\/ ( n 2 h 2 + n 2 k 2 + n 2 l 2 ) 

= 2asind/VN (5.9) 

where N is known as the reflection or line number. 
To illustrate this let us take as an example the second- 
order reflection from (10 0) planes. Then, since n = 2, 
h = 1, k = 0, and / = 0, this reflection is referred to 
either as the 200 reflection or as line 4. The lattice 
planes which give rise to a reflection at the smallest 
Bragg angle are those which are most widely spaced, 
i.e. those with a spacing equal to the cell edge, d i 0 o- 
The next planes in order of decreased spacing will 
be {110} planes for which duo = while the 

octahedral {111} planes will have a spacing equal to 
tf/V 3. The angle at which any of these planes in a 
crystal reflect an X-ray beam of wavelength X may be 
calculated by inserting the appropriate value of d into 
the Bragg equation. 

To ensure that Bragg’s law is satisfied and that 
reflections from various crystal planes can occur, it 
is necessary to provide a range of either 6 or X values. 
The various ways in which this can be done leads to the 
standard methods of X-ray diffraction, namely: (1) the 
Laue method, and (2) the powder method. 




Figure 5.9 (a) Diffraction from crystal planes, (b) Form of the atomic scattering curves for aluminium and zinc. 
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5.3.3 X-ray diffraction methods 

5.3.3. 1 Laue method 

In the Laue method, a stationary single crystal is 
bathed in a beam of ‘white’ radiation. Then, because 
the specimen is a fixed single crystal, the variable nec- 
essary to ensure that the Bragg law is satisfied for all 
the planes in the crystal has to be provided by the 
range of wavelengths in the beam, i.e. each set of crys- 
tal planes chooses the appropriate X from the ‘white’ 
spectrum to give a Bragg reflection. Radiation from a 
target metal having a high atomic number (e.g. tung- 
sten) is often used, but almost any form of ‘white’ 
radiation is suitable. In the experimental arrangement 
shown in Figure 5.10a, either a transmission photo- 
graph or a back-reflection photograph may be taken, 
and the pattern of spots which are produced lie on 
ellipses in the transmission case or hyperbolae in the 
back-reflection case. All spots on any ellipse or hyper- 
bola are reflections from planes of a single zone (i.e. 
where all the lattice planes are parallel to a common 
direction, the zone axis) and, consequently, the Laue 
pattern is able to indicate the symmetry of the crystal. 
For example, if the beam is directed along a [1 1 1] or 
[100] direction in the crystal, the Laue pattern will 
show three- or fourfold symmetry, respectively. The 
Laue method is used extensively for the determina- 
tion of the orientation of single crystals and, while 
charts are available to facilitate this determination, the 
method consists essentially of plotting the zones taken 
from the film on to a stereogram, and comparing the 
angles between them with a standard projection of that 
crystal structure. In recent years the use of the Laue 
technique has been extended to the study of imper- 
fections resulting from crystal growth or deformation, 
because it is found that the Laue spots from perfect 
crystals are sharp, while those from deformed crystals 
are, as shown in Figure 5.10b, elongated. This elon- 
gated appearance of the diffraction spots is known as 
asterism and it arises in an analogous way to the reflec- 
tion of light from curved mirrors. 

5. 3. 3. 2 Powder method 

The powder method, devised independently by Debye 
and Scherrer, is probably the most generally useful of 
all the X-ray techniques. It employs monochromatic 
radiation and a finely-powdered, or fine-grained poly- 
crystalline, wire specimen. In this case, 0 is the vari- 
able, since the collection of randomly-oriented crystals 
will contain sufficient particles with the correct orien- 
tation to allow reflection from each of the possible 
reflecting planes, i.e. the powder pattern results from 
a series of superimposed rotating crystal patterns. The 
angle between the direct X-ray beam and the reflected 
ray is 20, and consequently each set of crystal planes 
gives rise to a cone of reflected rays of semi-angle 20, 
where 0 is the Bragg angle for that particular set of 
reflecting planes producing the cone. Thus, if a film is 
placed around the specimen, as shown in Figure 5.11, 



Figure 5.10 (a) Laue method of X-ray diffraction. 

(b) Asterisms on a Laue transmission photograph of 
deformed zinc ( after Cahn, 1949). 

the successive diffracted cones, which consist of rays 
from hundreds of grains, intersect the film to produce 
concentric curves around the entrance and exit holes. 
Figure 5.12 shows examples of patterns from bcc and 
fee materials, respectively. 

Precise measurement of the pattern of diffraction 
lines is required for many applications of the powder 
method, but a good deal of information can readily be 
obtained merely by inspection. One example of this 
is in the study of deformed metals, since after defor- 
mation the individual spots on the diffraction rings are 
blurred so much that line-broadening occurs, especially 
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Figure 5.11 Powder method of X-ray diffraction. 
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Figure 5.12 Powder photographs taken in a Philips camera (114 mm radius) of (a) iron with cobalt radiation using an iron filter and (b) aluminium with copper radiation using 
a nickel filter. The high-angle lines are resolved and the separate reflections for X = Kaj and X = Kct 2 are observable.. 
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at high Bragg angles. On low-temperature annealing, 
the cold-worked material will tend to recover and this 
is indicated on the photograph by a sharpening of the 
broad diffraction lines. At higher annealing tempera- 
tures the metal will completely regain its softness by a 
process known as recrystallization (see Chapter 7) and 
this phenomenon is accompanied by the completion 
of the line- sharpening process. With continued anneal- 
ing, the grains absorb each other to produce a structure 
with an overall coarser grain size and, because fewer 
reflections are available to contribute to the diffrac- 
tion cones, the lines on the powder photograph take 
on a spotty appearance. This latter behaviour is some- 
times used as a means of determining the grain size of 
a polycrystalline sample. In practice, an X-ray photo- 
graph is taken for each of a series of known grain sizes 
to form a set of standards, and with them an unknown 
grain size can be determined quite quickly by com- 
paring the corresponding photograph with the set of 


standards. Yet a third use of the powder method as an 
inspection technique is in the detection of a preferred 
orientation of the grains of a polycrystalline aggregate. 
This is because a random orientation of the grains will 
produce a uniformly intense diffraction ring, while a 
preferred orientation, or texture, will concentrate the 
intensity at certain positions on the ring. The details 
of the texture require considerable interpretation and 
are discussed in Chapter 7. 

5. 3. 3. 3 X-ray diffractometry 
In addition to photographic recording, the diffracted 
X-ray beam may be detected directly using a counter 
tube (either Geiger, proportional or scintillation type) 
with associated electrical circuitry. The geometrical 
arrangement of such an X-ray diffractometer is shown 
in Figure 5.13a. A divergent beam of filtered or 
monochromatized radiation impinges on the flat face 
of a powder specimen. This specimen is rotated at 


(a) 



Monochromator 
(Quartz crystal) 



Figure 5.13 Geometry of (a) conventional diffractometer and (b) small-angle scattering diffractometer, (c) chart record of 
diffraction pattern from aluminium powder with copper radiation using nickel filter. 
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precisely one-half of the angular speed of the receiving 
slit so that a constant angle between the incident and 
reflected beams is maintained. The receiving slit is 
mounted in front of the counter on the counter tube 
arm, and behind it is usually fixed a scatter slit to 
ensure that the counter receives radiation only from 
the portion of the specimen illuminated by the primary 
beam. The intensity diffracted at the various angles is 
recorded automatically on a chart of the form shown 
in Figure 5.13c, and this can quickly be analysed for 
the appropriate 0 and d values. 

The technique is widely used in routine chemical 
analysis, since accurate intensity measurements allow 
a quantitative estimate of the various elements in the 
sample to be made. In research, the technique has 
been applied to problems such as the degree of order 
in alloys, the density of stacking faults in deformed 
alloys, elastic constant determination, the study of 
imperfections and preferred orientation. 

5. 3. 3. 4 X-ray topography 

With X-rays it is possible to study individual crys- 
tal defects by detecting the differences in intensity 
diffracted by regions of the crystal near dislocations, 
for example, and more nearly perfect regions of the 
crystal. Figure 5.14a shows the experimental arrange- 
ment schematically in which collimated monochro- 
matic Kot - radiation and photographic recording is 
used. 

Any imperfections give rise to local changes in 
diffracted or transmitted X-ray intensities and, con- 
sequently, dislocations show up as bands of contrast, 
some 5 to 50 pm wide. No magnification is used in 


recording the diffraction image, but subsequent magni- 
fication of up to 500 times may be achieved with high- 
resolution X-ray emulsions. Large areas of the crystal 
to thicknesses of 10-100 pm can be mapped using 
scanning techniques provided the dislocation density 
is not too high (^10 10 m -2 ). 

The X-ray method of detecting lattice defects suf- 
fers from the general limitations that the resolution 
is low and exposure times are long (12 h) although 
very high intensity X-ray sources are now avail- 
able from synchrotrons and are being used increas- 
ingly with very short exposure times (^minutes). By 
comparison, the thin-film electron microscopy method 
(see Section 5.4.2) is capable of revealing dislocations 
with a much higher resolution because the disloca- 
tion image width is 10 nm or less and magnifications 
up to 100000 times are possible. The X-ray method 
does, however, have the great advantage of being able 
to reveal dislocations in crystals which are compara- 
tively thick (~1 mm, cf. 0.1 pm in foils suitable for 
transmission electron microscopy). The technique has 
been used for studying in detail the nature of dis- 
locations in thick single crystals with very low dis- 
location densities, such as found in semiconducting 
materials; Figure 5.14b shows an example of an X-ray 
topograph revealing dislocations in magnesium by this 
technique. 

5.3.4 Typical interpretative procedures for 
diffraction patterns 

5.3.4. 1 Intensity of diffraction 

Many applications of the powder method depend on 

the accurate measurement of either line position or 
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Figure 5.14 (a) Geometry of X-ray topographic technique, (b) topograph from a magnesium single crystal showing 
dislocation loops, g = 0 1 1 0 (after Vale and Smallman, 1977). 
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line intensity. The arrangement of the diffraction lines 
in any pattern is characteristic of the material being 
examined and, consequently, an important practical 
use of the method is in the identification of unknown 
phases. Thus, it will be evident that equation (5.9) 
can indicate the position of the reflected beams, 
as determined by the size and shape of the unit 
cell, but not the intensities of the reflected beams. 
These are determined not by the size of the unit 
cell but by the distribution of atoms within it, 
and while cubic lattices give reflections for every 
possible value of (h 2 + k 2 + l 2 ) all other structures 
give characteristic absences. Studying the indices of 
the ‘absent’ reflections enables different structures to 
be distinguished. 

In calculating the intensity scattered by a given 
atomic structure, we have first to consider the inten- 
sity scattered by one atom, and then go on to consider 
the contribution from all the other atoms in the par- 
ticular arrangement which make up that structure. The 
efficiency of an atom in scattering X-rays is usually 
denoted by /, the atomic scattering factor, which is 
the ratio of amplitude scattered by an atom A a to that 
by a single electron A e . If atoms were merely points, 
their scattering factors would be equal to the number 
of electrons they contain, i.e. to their atomic numbers, 
and the relation 7 a = Z 2 ./ e would hold since intensity 
is proportional to the square of amplitude. However, 
because the size of the atom is comparable to the wave- 
length of X-rays, scattering from different parts of the 
atom is not in phase, and the result is that h < z 2 ./ e . 
The scattering factor, therefore, depends both on angle 
0 and on the wavelength of X-rays used, as shown in 
Figure 5.9, because the path difference for the individ- 
ual waves scattered from the various electrons in the 
atom is zero when 0 = 0 and increases with increasing 
0. Thus, to consider the intensity scattered by a given 
structure, it is necessary to sum up the waves which 
come from all the atoms of one unit cell of that struc- 
ture, since each wave has a different amplitude and 
a different phase angle due to the fact that it comes 
from a different part of the structure. The square of 
the amplitude of the resultant wave, F, then gives the 
intensity, and this may be calculated by using the /- 
values and the atomic coordinates of each atom in the 
unit cell. It can be shown that a general formula for 
the intensity is 

I a |F| 2 = [fi cos27t(/dci + kyi + lz\) 

+ / 2 cos 2 ir{hx 2 + ky 2 + lz 2 ) + • • d 2 

+ [/ 1 sin 2n(hx\ + ky x + l Z\) 

+ f 2 sin 2n(hx 2 + ky 2 + Izi) H ] 2 

(5.10) 

where x\, y u zu x 2 , yi, Z 2 , etc., are the coordinates 
of those atoms having scattering factors / 1 , / 2 , etc., 
respectively, and hkl are the indices of the reflection 
being computed. For structures having a centre of 


symmetry, which includes most metals, the expression 
is much simpler because the sine terms vanish. 

This equation may be applied to any structure, 
but to illustrate its use let us examine a pure metal 
crystallizing in the bcc structure. From Figure 2.11c 
it is clear that the structure has identical atoms (i.e. 
fi = / 2 ) at the coordinates (000) and \ so that 
equation (5.10) becomes: 

I a f 2 [ cos 271.0 + cos 2n(h/2 + k/2 + / /2)] 2 

= / 2 [1 +COS7T(/t + £ + /)] 2 (5.11) 

It then follows that I is equal to zero for every 
reflection having ( h + k + l ) an odd number. The 
significance of this is made clear if we consider 
in a qualitative way the 100 reflection shown in 
Figure 5.15a. To describe a reflection as the first-order 
reflection from (100) planes implies that there is 
1A, phase-difference between the rays reflected from 
planes A and those reflected from planes A'. However, 
the reflection from the plane B situated half-way 
between A and A! will be k/2 out of phase with that 
from plane A, so that complete cancellation of the 100 
reflected ray will occur. The 100 reflection is therefore 
absent, which agrees with the prediction made from 
equation (5.11) that the reflection is missing when 
(h + k + l) is an odd number. A similar analysis shows 
that the 20 0 reflection will be present (Figure 5.15b), 
since the ray from the B plane is now exactly Ik out 
of phase with the rays from A and A'. In consequence, 
if a diffraction pattern is taken from a material having 
a bcc structure, because of the rule governing the sum 
of the indices, the film will show diffraction lines 
almost equally spaced with indices N = 2, (110); 
4, (200); 6, (211); 8, (220); . . ., as shown in 
Figure 5.12a. Application of equation (5.10) to a pure 




Figure 5.15 (a) 1 0 0 reflection from bcc cell showing 
interference of diffracted rays, (b) 2 0 0 reflection showing 
reinforcement (after Barrett, 1952; courtesy of 
McGraw-Hill). 
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metal with fee structure shows that ‘absent’ reflections 
will occur when the indices of that reflection are 
mixed, i.e. when they are neither all odd nor all even. 
Thus, the corresponding diffraction pattern will contain 
lines according to N = 3, 4, 8, 1 1, 12, 16, 19, 20, etc; 
and the characteristic feature of the arrangement is 
a sequence of two lines close together and one line 
separated, as shown in Figure 5.12b. 

Equation (5.10) is the basic equation used for deter- 
mining unknown structures, since the determination of 
the atomic positions in a crystal is based on this rela- 
tion between the coordinates of an atom in a unit cell 
and the intensity with which it will scatter X-rays. 

5. 3. 4. 2 Determination of lattice parameters 
Perhaps the most common use of the powder method 
is in the accurate determination of lattice parame- 
ters. From the Bragg law we have the relation a = 
Xy/N/ 2 sin 6 which, because both k and N are known 
and 6 can be measured for the appropriate reflection, 
can be used to determine the lattice parameter of a 
material. Several errors are inherent in the method, 
however, and the most common include shrinkage of 
the film during processing, eccentricity of the speci- 
men and the camera, and absorption of the X-rays in 
the sample. These errors affect the high-angle diffrac- 
tion lines least and, consequently, the most accurate 
parameter value is given by determining a value of 
a from each diffraction line, plotting it on a graph 
against an angular function 1 of the cos 2 0-type and then 
extrapolating the curve to 0 = 90°. 

The determination of precision lattice parameters is 
of importance in many fields of materials science, par- 
ticularly in the study of thermal expansion coefficients, 
density determinations, the variation of properties with 
composition, precipitation from solid solution, and 
thermal stresses. At this stage it is instructive to con- 
sider the application of lattice parameter measurements 
to the determination of phase boundaries in equilibrium 
diagrams, since this illustrates the general usefulness 
of the technique. The diagrams shown in Figures 5.16a 
and 5.16b indicate the principle of the method. A varia- 
tion of alloy composition within the single-phase field, 
a, produces a variation in the lattice parameter, a, 
since solute B , which has a different atomic size to 
the solvent A, is being taken into solution. However, 
at the phase boundary solvus this variation in a ceases, 
because at a given temperature the composition of the 
a-phase remains constant in the two-phase field, and 
the marked discontinuity in the plot of lattice parameter 
versus composition indicates the position of the phase 
boundary at that temperature. The change in solid solu- 
bility with temperature may then be obtained, either by 
taking diffraction photographs in a high-temperature 

1 Nelson and Riley suggest the function 

I cos 2 6 cos 2 0\ 

^ sin<9 + 0 J 




Figure 5.16 (a) and (b) Phase-boundary determination 
using lattice parameter measurements. 

camera at various temperatures or by quenching the 
powder sample from the high temperature to room 
temperature (in order to retain the high temperature 
state of solid solution down to room temperature) and 
then taking a powder photograph at room temperature. 

5. 3. 4. 3 Line-broadening 

Diffraction lines are not always sharp because of 
various instrumental factors such as slit size, specimen 
condition, and spread of wavelengths, but in addition 
the lines may be broadened as a result of lattice strain 
in the region of the crystal diffracting and also its 
limited dimension. Strain gives rise to a variation of the 
interplanar spacing Ad and hence diffraction occurs 
over a range AO and the breadth due to strain is then 

y0 s = 37tan0 (5.12) 

where r] is the strain distribution. If the dimension 
of the crystal diffracting the X-rays is small, 2 then 
this also gives rise to an appreciable ‘particle- size’ 
broadening given by the Scherrer formula 

/3 p = k/t cos 6 (5.13) 

2 The optical analogue of this effect is the broadening of 
diffraction lines from a grating with a limited number of 
lines. 
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where t is the effective particle size. In practice this 
size is the region over which there is coherent diffrac- 
tion and is usually defined by boundaries such as dislo- 
cation walls. It is possible to separate the two effects 
by plotting the experimentally measured broadening 
cos 0/A, against sin 0/X, when the intercept gives a 
measure of t and the slope r\. 

5. 3.4.4 Small-angle scattering 
The scattering of intensity into the low-angle region 
(e = 20 < 10°) arises from the presence of inhomo- 
geneities within the material being examined (such as 
small clusters of solute atoms), where these inhomo- 
geneities have dimensions only 10 to 100 times the 
wavelength of the incident radiation. The origin of 
the scattering can be attributed to the differences in 
electron density between the heterogeneous regions 
and the surrounding matrix, 1 so that precipitated par- 
ticles afford the most common source of scattering; 
other heterogeneities such as dislocations, vacancies 
and cavities must also give rise to some small-angle 
scattering, but the intensity of the scattered beam will 
be much weaker than this from precipitated particles. 
The experimental arrangement suitable for this type of 
study is shown in Figure 5.13b. 

Interpretation of much of the small- angle scatter 
data is based on the approximate formula derived by 
Guinier, 

/ = Mn 2 7 e exp [-4;r 2 e 2 /? 2 /3X 2 ] (5. 14) 

where M is the number of scattering aggregates, or 
particles, in the sample, n represents the difference in 
number of electrons between the particle and an equal 
volume of the surrounding matrix, R is the radius of 
gyration of the particle, 7 e is the intensity scattered 
by an electron, s is the angle of scattering and X is 
the wavelength of X-rays. From this equation it can 
be seen that the intensity of small-angle scattering is 
zero if the inhomogeneity, or cluster, has an electron 
density equivalent to that of the surrounding matrix, 
even if it has quite different crystal structure. On a 
plot of log 10 / as a function of £ 2 , the slope near the 
origin, s = 0, is given by 

P = ~(4n 2 /3X 2 )R 2 \og l0 e 

which for Cu Ka radiation gives the radius of gyration 
of the scattering aggregate to be 

R = 0.0645 x P 1/2 nm (5.15) 

It is clear that the technique is ideal for studying 
regions of the structure where segregation on too fine 
a scale to be observable in the light microscope has 
occurred, e.g. the early stages of phase precipitation 

^he halo around the moon seen on a clear frosty night is 
the best example, obtained without special apparatus, of the 
scattering of light at small angles by small particles. 


(see Chapter 8), and the aggregation of lattice defects 
(see Chapter 4). 

5. 3. 4. 5 The reciprocal lattice concept 
The Bragg law shows that the conditions for diffraction 
depend on the geometry of sets of crystal planes. To 
simplify the more complex diffraction problems, use 
is made of the reciprocal lattice concept in which the 
sets of lattice planes are replaced by a set of points, 
this being geometrically simpler. 

The reciprocal lattice is constructed from the real 
lattice by drawing a line from the origin normal to 
the lattice plane hkl under consideration of length, 
d*, equal to the reciprocal of the interplanar spacing 
dhkh The construction of part of the reciprocal lattice 
from a face-centred cubic crystal lattice is shown in 
Figure 5.17. 

Included in the reciprocal lattice are the points 
which correspond not only to the true lattice planes 
with Miller indices (hkl) but also to the fictitious 
planes (nh, nk, nl) which give possible X-ray 
reflections. The reciprocal lattice therefore corresponds 
to the diffraction spectrum possible from a particular 
crystal lattice and, since a particular lattice type is 
characterized by ‘absent’ reflections the corresponding 
spots in the reciprocal lattice will also be missing. It 
can be deduced that a fee Bravais lattice is equivalent 
to a bcc reciprocal lattice, and vice versa. 

A simple geometrical construction using the recipro- 
cal lattice gives the conditions that correspond to Bragg 
reflection. Thus, if a beam of wavelength X is incident 
on the origin of the reciprocal lattice, then a sphere of 
radius l/X drawn through the origin will intersect those 
points which correspond to the reflecting planes of a 
stationary crystal. This can be seen from Figure 5.18, 
in which the reflecting plane AB has a reciprocal point 
at d*. If d* lies on the surface of the sphere of radius 
l/X then 

d* = 1 /d hkl = 2sin 0/X (5.16) 
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Figure 5.18 Construction of the Ewald reflecting sphere. 



Figure 5.19 Principle of the power method. 


and the Bragg law is satisfied; the line joining the 
origin to the operating reciprocal lattice spot is usually 
referred to as the g-vector. It will be evident that at 
any one setting of the crystal, few, if any, points will 
touch the sphere of reflection. This is the condition 
for a stationary single crystal and a monochromatic 
beam of X-rays, when the Bragg law is not obeyed 
except by chance. To ensure that the Bragg law is 
satisfied the crystal has to be rotated in the beam, since 
this corresponds to a rotation of the reciprocal lattice 
about the origin when each point must pass through the 
reflection surface. The corresponding reflecting plane 
reflects twice per revolution. 

To illustrate this feature let us re-examine the pow- 
der method. In the powder specimen, the number of 
crystals is sufficiently large that all possible orienta- 
tions are present and in terms of the reciprocal lattice 
construction we may suppose that the reciprocal lat- 
tice is rotated about the origin in all possible direc- 
tions. The locus of any one lattice point during such 
a rotation is, of course, a sphere. This locus-sphere 
will intersect the sphere of reflection in a small cir- 
cle about the axis of the incident beam as shown in 
Figure 5.19, and any line joining the centre of the 
reflection sphere to a point on this small circle is a pos- 
sible direction for a diffraction maximum. This small 


circle corresponds to the powder halo discussed previ- 
ously. From Figure 5.19 it can be seen that the radius 
of the sphere describing the locus of the reciprocal 
lattice point (hkl) is 1 /d^ki) and that the angle of devi- 
ation of the diffracted beam 20 is given by the relation 

(2/A.) sin 9 = l/d m) 
which is the Bragg condition. 


5.4 Analytical electron microscopy 

5.4.1 Interaction of an electron beam with a 
solid 

When an electron beam is incident on a solid specimen 
a number of interactions take place which generate use- 
ful structural information. Figure 5.20 illustrates these 
interactions schematically. Some of the incident beam 
is back-scattered and some penetrates the sample. If 
the specimen is thin enough a significant amount is 
transmitted, with some electrons elastically scattered 
without loss of energy and some inelastically scattered. 
Interaction with the atoms in the specimen leads to the 
ejection of low-energy electrons and the creation of 
X-ray photons and Auger electrons, all of which can 
be used to characterize the material. 

The two inelastic scattering mechanisms important 
in chemical analysis are (1) excitation of the electron 
gas plasmon scattering, and (2) single-electron scat- 
tering. In plasmon scattering the fast electron excites 
a ripple in the plasma of free electrons in the solid. 
The energy of this ‘plasmon’ depends only on the vol- 
ume concentration of free electrons n in the solid and 
given by E v = [ne 2 /m] l/1 . Typically E p , the energy 
loss suffered by the fast electron is ~15 eV and the 
scattering intensity/unit solid angle has an angular half- 
width given by 0 E = E p /2E 0 , where E 0 is the incident 
voltage; 0 E is therefore ~10 -4 radian. The energy 



Figure 5.20 Scattering of incident electrons by thin foil. 
With a bulk specimen the transmitted, elastic and inelastic 
scattered beams are absorbed. 
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of the plasmon is converted very quickly into atom 
vibrations (heat) and the mean-free path for plasmon 
excitation is small, ~50-150 nm. With single- electron 
scattering energy may be transferred to single elec- 
trons (rather than to the large number ~10 5 involved 
in plasmon excitation) by the incident fast electrons. 
Lightly-bound valency electrons may be ejected, and 
these electrons can be used to form secondary images 
in SEM; a very large number of electrons with ener- 
gies up to ~50 eV are ejected when a high-energy 
electron beam strikes a solid. The useful collisions are 
those where the single electron is bound. There is a 
minimum energy required to remove the single elec- 
tron, i.e. ionization, but provided the fast electron gives 
the bound electron more than this minimum amount, 
it can give the bound electron any amount of energy, 
up to its own energy (e.g. 100 keV). Thus, instead of 
the single-electron excitation process turning up in the 
energy loss spectrum of the fast electron as a peak, 
as happens with plasmon excitation, it turns up as an 
edge. Typically, the mean free path for inner shell ion- 
ization is several micrometres and the energy loss can 
be several keV. The angular half-width of scattering 
is given by AE/2E 0 . Since the energy loss A E can 
vary from ~10 eV to tens of keV the angle can vary 
upwards from 1CT 4 radian (see Figure 5.36). 

A plasmon, once excited, decays to give heat, which 
is not at all useful. In contrast, an atom which has had 
an electron removed from it decays in one of two ways, 
both of which turn out to be very useful in chemical 
analysis leading to the creation of X-rays and Auger 
electrons. The first step is the same for both cases. An 
electron from outer shell, which therefore has more 
energy than the removed electron, drops down to fill 
the hole left by the removal of the bound electron. Its 
extra energy, A E, equal to the difference in energy 
between the two levels involved and therefore abso- 
lutely characteristic of the atom, must be dissipated. 
This may happen in two ways: (1) by the creation 
of a photon whose energy, hv , equals the energy dif- 
ference A E. For electron transitions of interest, A E, 
and therefore hv , is such that the photon is an X-ray, 
(2) by transferring the energy to a neighbouring elec- 
tron, which is then ejected from the atom. This is an 
‘Auger’ electron. Its energy when detected will depend 
on the original energy difference A E minus the binding 
energy of the ejected electron. Thus the energy of the 
Auger electron depends on three atomic levels rather 
than two as for emitted photons. The energies of the 
Auger electrons are sufficiently low that they escape 
from within only about 5 nm of the surface. This is 
therefore a surface analysis technique. The ratio of 
photon -Auger yield is called the fluorescence ratio co, 
and depends on the atom and the shells involved. For 
the X-shell, co is given by co K = X K /(A K + X K ), where 
X K and A k are, respectively, the number of X-ray pho- 
tons and Auger electrons emitted. A K is independent 
of atomic number Z, and X K is proportional to Z 4 so 
that co K Z 4 /(a + Z 4 ), where a = 1.12xl0 6 . Light ele- 
ments and outer shells (L-lines) have lower yields; for 


X-series transitions co K varies from a few per cent for 
carbon up to >90% for gold. 

5.4.2 The transmission electron microscope 
(TEM) 

Section 5.2.1 shows that to increase the resolving 
power of a microscope it is necessary to employ shorter 
wavelengths. For this reason the electron microscope 
has been developed to allow the observation of struc- 
tures which have dimensions down to less than 1 nm. 
An electron microscope consists of an electron gun 
and an assembly of lenses all enclosed in an evacuated 
column. A very basic system for a transmission elec- 
tron microscope is shown schematically in Figure 5.21. 
The optical arrangement is similar to that of the glass 
lenses in a projection- type light microscope, although 
it is customary to use several stages of magnification 
in the electron microscope. The lenses are usually of 
the magnetic type, i.e. current-carrying coils which are 
completely surrounded by a soft iron shroud except 
for a narrow gap in the bore, energized by d.c. and, 
unlike the lenses in a light microscope, which have 
fixed focal lengths, the focal length can be controlled 
by regulating the current through the coils of the lens. 



Figure 5.21 Schematic arrangement of a basic transmission 
electron microscope system. 
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This facility compensates for the fact that it is difficult 
to move the large magnetic lenses in the evacuated 
column of the electron microscope in an analogous 
manner to the glass lenses in a light microscope. 

The condenser lenses are concerned with collimating 
the electron beam and illuminating the specimen which 
is placed in the bore of the objective lens. The function 
of the objective lens is to form a magnified image of up 
to about 40 x in the object plane of the intermediate, 
or first projector lens. A small part of this image then 
forms the object for the first projector lens, which gives 
a second image, again magnified in the object plane of 
the second projector lens. The second projector lens 
is capable of enlarging this image further to form a 
final image on the fluorescent viewing screen. This 
image, magnified up to 100 000 x may be recorded 
on a photographic film beneath the viewing screen. A 
stream of electrons can be assigned a wavelength X 
given by the equation X = h/mv, where h is Planck’s 
constant and mv is the and hence to the voltage applied 
to the electron gun, according to the approximate 
relation 

X = ^(U/V^m (5.17) 

and, since normal operating voltages are between 50 
and 100 kV, the value of X used varies from 0.0054 nm 
to 0.0035 nm. With a wavelength of 0.005 nm if one 
could obtain a value of (/x sin a) for electron lenses 
comparable to that for optical lenses, i.e. 1.4, it would 
be possible to see the orbital electrons. However, mag- 
netic lenses are more prone to spherical and chromatic 
aberration than glass lenses and, in consequence, small 
apertures, which correspond to a-values of about 0.002 
radian, must be used. As a result, the resolution of 
the electron microscope is limited to about 0.2 nm. It 
will be appreciated, of course, that a variable magni- 
fication is possible in the electron microscope without 
relative movement of the lenses, as in a light micro- 
scope, because the depth of focus of each image, being 
inversely proportional to the square of the numerical 
aperture, is so great. 

5.4.3 The scanning electron microscope 

The surface structure of a metal can be studied in the 
TEM by the use of thin transparent replicas of the sur- 
face topography. Three different types of replica are 
in use, (1) oxide, (2) plastic, and (3) carbon replicas. 
However, since the development of the scanning elec- 
tron microscope (SEM) it is very much easier to study 
the surface structure directly. 

A diagram of the SEM is shown in Figure 5.22. The 
electron beam is focused to a spot ~10 nm diameter 
and made to scan the surface in a raster. Electrons from 
the specimen are focused with an electrostatic elec- 
trode on to a biased scintillator. The light produced is 
transmitted via a Perspex light pipe to a photomulti- 
plier and the signal generated is used to modulate the 
brightness of an oscilloscope spot which traverses a 
raster in exact synchronism with the electron beam at 



Figure 5.22 Schematic diagram of a basic scanning electron 
microscope (courtesy of Cambridge Instrument Co.). 

the specimen surface. The image observed on the oscil- 
loscope screen is similar to the optical image and the 
specimen is usually tilted towards the collector at a low 
angle (<30° ) to the horizontal, for general viewing. 

As initially conceived, the SEM used backscat- 
tered electrons (with E ~ 30 kV which is the inci- 
dent energy) and secondary electrons (E ~ 100 eV) 
which are ejected from the specimen. Since the sec- 
ondary electrons are of low energy they can be bent 
round corners and give rise to the topographic con- 
trast. The intensity of backscattered (BS) electrons is 
proportional to atomic number but contrast from these 
electrons tends to be swamped because, being of higher 
energy, they are not so easily collected by the normal 
collector system used in SEMs. If the secondary elec- 
trons are to be collected a positive bias of ~200 V is 
applied to the grid in front of the detector; if only the 
back-scattered electrons are to be collected the grid is 
biased negatively to ~200 V. 

Perhaps the most significant development in recent 
years has been the gathering of information relating to 
chemical composition. As discussed in Section 5.4.1, 
materials bombarded with high-energy electrons can 
give rise to the emissions of X-rays characteristic of 
the material being bombarded. The X-rays emitted 
when the beam is stopped on a particular region of 
the specimen may be detected either with a solid- 
state (Li-drifted silicon) detector which produces a 
voltage pulse proportional to the energy of the incident 
photons (energy-dispersive method) or with an X-ray 
spectrometer to measure the wavelength and intensity 
(wavelength-dispersive method). The microanalysis of 
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materials is presented in Section 5.4.5. Alternatively, 
if the beam is scanned as usual and the intensity of the 
X-ray emission, characteristic of a particular element, 
is used to modulate the CRT, an image showing the 
distribution of that element in the sample will result. X- 
ray images are usually very ‘noisy’ because the X-ray 
production efficiency is low, necessitating exposures a 
thousand times greater than electron images. 

Collection of the back- scattered (BS) electrons with 
a specially located detector on the bottom of the lens 
system gives rise to some exciting applications and 
opens up a completely new dimension for SEM from 
bulk samples. The BS electrons are very sensitive to 
atomic number Z and hence are particularly impor- 
tant in showing contrast from changes of composi- 
tion, as illustrated by the image from a silver alloy 
in Figure 5.23. This atomic number contrast is par- 
ticularly effective in studying alloys which normally 


are difficult to study because they cannot be etched. 
The intensity of back- scattered electrons is also sen- 
sitive to the orientation of the incident beam relative 
to the crystal. This effect will give rise to ‘orienta- 
tion’ contrast from grain to grain in a poly crystalline 
specimen as the scan crosses several grains. In addi- 
tion, the effect is also able to provide crystallographic 
information from bulk specimens by a process known 
as electron channelling. As the name implies, the elec- 
trons are channelled between crystal planes and the 
amount of channelling per plane depends on its pack- 
ing and spacing. If the electron beam impinging on a 
crystal is rocked through a large angle then the amount 
of channelling will vary with angle and hence the BS 
image will exhibit contrast in the form of electron 
channelling patterns which can be used to provide crys- 
tallographic information. Figure 5.24 shows the ‘orien- 
tation’ or channelling contrast exhibited by a Fe-3%Si 



Figure 5.23 Back-cattered electron image by atomic number contrast from 70Ag-30Cu alloy showing (a) a-dendrites + 
eutectic and (b) eutectic (courtesy of B. W. Hutchinson ). 



Figure 5.24 (a) Back-scattered electron image and (b) associated channelling pattern, from secondary recrystallized Fe-3%Si 
(courtesy ofB. W. Hutchinson). 
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specimen during secondary recrystallization (a process 
used for transformer lamination production) and the 
channelling pattern can be analysed to show that the 
new grain possesses the Goss texture. Electron chan- 
nelling occurs only in relatively perfect crystals and 
hence the degradation of electron channelling patterns 
may be used to monitor the level of plastic strain, for 
example to map out the plastic zone around a fatigue 
crack as it develops in an alloy. 

The electron beam may also induce electrical effects 
which are of importance particularly in semiconductor 
materials. Thus a 30 kV electron beam can generate 
some thousand excess free electrons and the equiv- 
alent number of ions (‘holes’), the vast majority of 
which recombine. In metals, this recombination pro- 
cess is very fast (1 ps) but in semiconductors may be a 
few seconds depending on purity. These excess current 
carriers will have a large effect on the limited conduc- 
tivity. Also the carriers generated at one point will 
diffuse towards regions of lower carrier concentration 
and voltages will be established whenever the carriers 
encounter regions of different chemical composition 
(e.g. impurities around dislocations). The conductiv- 
ity effect can be monitored by applying a potential 
difference across the specimen from an external bat- 
tery and using the magnitude of the resulting current 
to modulate the CRT brightness to give an image of 
conductivity variation. 

The voltage effect arising from different carrier con- 
centrations or from accumulation of charge on an insu- 
lator surface or from the application of an external 
electromotive force can modify the collection of the 
emitted electrons and hence give rise to voltage con- 
trast. Similarly, a magnetic field arising from ferromag- 
netic domains, for example, will affect the collection 
efficiency of emitted electrons and lead to magnetic 
field contrast. 

The secondary electrons, i.e. lightly-bound electrons 
ejected from the specimen which give topographical 
information, are generated by the incident electrons, 
by the back- scattered electrons and by X-rays. The 
resolution is typically ~10 nm at 20 kV for medium 
atomic weight elements and is limited by spreading 
of electrons as they penetrate into the specimen. The 
back-scattered electrons are also influenced by beam 
spreading and for a material of medium atomic weight 
the resolution is ~ 100 nm. The specimen current mode 
is limited both by spreading of the beam and the noise 
of electronic amplification to a spatial resolution of 
500 nm and somewhat greater values ~1 pm apply to 
the beam-induced conductivity and X-ray modes. 

5.4.4 Theoretical aspects of TEM 

5.4.4. 1 Imaging and diffraction 
Although the examination of materials may be carried 
out with the electron beam impinging on the surface at 
a ‘glancing incidence’, most electron microscopes are 
aligned for the use of a transmission technique, since 
added information on the interior of the specimen may 


be obtained. In consequence, the thickness of the metal 
specimen has to be limited to below a micrometre, 
because of the restricted penetration power of the 
electrons. Three methods now in general use for 
preparing such thin films are (1) chemical thinning, 
(2) electropolishing, and (3) bombarding with a beam 
of ions at a potential of about 3 kV. Chemical thinning 
has the disadvantage of preferentially attacking either 
the matrix or the precipitated phases, and so the 
electropolishing technique is used extensively to 
prepare thin metal foils. Ion beam thinning is quite 
slow but is the only way of preparing thin ceramic 
and semiconducting specimens. 

Transmission electron microscopy provides both 
image and diffraction information from the same small 
volume down to 1 pm in diameter. Ray diagrams for 
the two modes of operation, imaging and diffrac- 
tion, are shown in Figure 5.25. Diffraction contrast 1 
is the most common technique used and, as shown 
in Figure 5.25a, involves the insertion of an objective 
aperture in the back focal plane, i.e. in the plane in 
which the diffraction pattern is formed, to select either 
the directly-transmitted beam or a strong diffracted 
beam. Images obtained in this way cannot possi- 
bly contain information concerning the periodicity of 



(a) (b) 

Figure 5.25 Schematic ray diagrams for (a) imaging and 
(b) diffraction. 


Another imaging mode does allow more than one beam to 
interfere in the image plane and hence crystal periodicity 
can be observed; the larger the collection angle, which is 
generally limited by lens aberrations, the smaller the 
periodicity that can be resolved. Interpretation of this direct 
imaging mode, while apparently straightforward, is still 
controversial, and will not be covered here. 
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the crystal, since this information is contained in the 
spacing of diffraction maxima and the directions of 
diffracted beams, information excluded by the objec- 
tive aperture. 

Variations in intensity of the selected beam is the 
only information provided. Such a mode of imaging, 
carried out by selecting one beam in TEM, is unusual 
and the resultant images cannot be interpreted simply 
as high-magnification images of periodic objects. In 
formulating a suitable theory it is necessary to consider 
what factors can influence the intensity of the directly- 
transmitted beam and the diffracted beams. The obvi- 
ous factors are (1) local changes in scattering factor, 
e.g. particles of heavy metal in light metal matrix, 
(2) local changes in thickness, (3) local changes in ori- 
entation of the specimen, or (4) discontinuities in the 
crystal planes which give rise to the diffracted beams. 
Fortunately, the interpretation of any intensity changes 
is relatively straightforward if it is assumed that there 
is only one strong diffracted beam excited. Moreover, 
since this can be achieved quite easily experimentally, 
by orienting the crystal such that strong diffraction 
occurs from only one set of crystal planes, virtually 
all TEM is carried out with a two-beam condition: 
a direct and a diffracted beam. When the direct, or 
transmitted, beam only is allowed to contribute to 
the final image by inserting a small aperture in the 
back focal plane to block the strongly diffracted ray, 
then contrast is shown on a bright background and is 
known as bright-field imaging. If the diffracted ray 
only is allowed through the aperture by tilting the 
incident beam then contrast on a dark background is 
observed and is known as dark-field imaging. These 
two arrangements are shown in Figure 5.26. 

A dislocation can be seen in the electron microscope 
because it locally changes the orientation of the crystal, 
thereby altering the diffracted intensity. This is illus- 
trated in Figure 5.27. Any region of a grain or crystal 
which is not oriented at the Bragg angle, i.e. 6 > 0 B , 
is not strongly diffracting electrons. However, in the 
vicinity of the dislocation the lattice planes are tilted 
such that locally the Bragg law is satisfied and then 



Figure 5.26 Schematic diagram illustrating (a) bright-field 
and (b) dark- field image formation. 


Incident beam 



Figure 5.27 Mechanism of diffraction contrast: the planes 
to the RHS of the dislocation are bent so that they closely 
approach the Bragg condition and the intensity of the direct 
beam emerging from the crystal is therefore reduced. 

strong diffraction arises from near the defect. These 
diffracted rays are blocked by the objective aperture 
and prevented from contributing to the final image. 
The dislocation therefore appears as a dark line (where 
electrons have been removed) on a bright background 
in the bright-field picture. 

The success of transmission electron microscopy 
(TEM) is due, to a great extent, to the fact that it is 
possible to define the diffraction conditions which give 
rise to the dislocation contrast by obtaining a diffrac- 
tion pattern from the same small volume of crystal (as 
small as 1 pm diameter) as that from which the elec- 
tron micrograph is taken. Thus, it is possible to obtain 
the crystallographic and associated diffraction infor- 
mation necessary to interpret electron micrographs. To 
obtain a selected area diffraction pattern (SAD) an 
aperture is inserted in the plane of the first image so 
that only that part of the specimen which is imaged 
within the aperture can contribute to the diffraction pat- 
tern. The power of the diffraction lens is then reduced 
so that the back focal plane of the objective is imaged, 
and then the diffraction pattern, which is focused in 
this plane, can be seen after the objective aperture is 
removed. 

The usual type of transmission electron diffraction 
pattern from a single crystal region is a cross-grating 
pattern of the form shown in Figure 5.28. The simple 
explanation of the pattern can be given by considering 
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Figure 5.28 fee cross- grating patterns (a) [0 0 1], (b) [1 0 1] and (c) [1 1 1]. 


the reciprocal lattice and reflecting sphere construc- 
tion commonly used in X-ray diffraction. In electron 
diffraction, the electron wavelength is extremely short 
(X = 0.0037 nm at 100 kV) so that the radius of the 
Ewald reflecting sphere is about 2.5 nm -1 , which is 
about 50 times greater than g, the reciprocal lattice 
vector. Moreover, because X is small the Bragg angles 
are also small (about 10 -2 radian or | for low-order 
reflections) and hence the reflection sphere may be 
considered as almost planar in this vicinity. If the elec- 
tron beam is closely parallel to a prominent zone axis 
of the crystal then several reciprocal points (somewhat 
extended because of the limited thickness of the foil) 
will intersect the reflecting sphere, and a projection of 
the prominent zone in the reciprocal lattice is obtained, 
i.e. the SAD pattern is really a photograph of a recip- 
rocal lattice section. Figure 5.28 shows some standard 
cross-grating for fee crystals. Because the Bragg angle 
for reflection is small (~ \ ) only those lattice planes 
which are almost vertical, i.e. almost parallel to the 
direction of the incident electron beam, are capable 
of Bragg-diffracting the electrons out of the objective 
aperture and giving rise to image contrast. Moreover, 
because the foil is buckled or purposely tilted, only 
one family of the various sets of approximately ver- 
tical lattice planes will diffract strongly and the SAD 
pattern will then show only the direct beam spot and 
one strongly diffracted spot (see insert Figure 5.40). 
The indices g of the crystal planes ( hkl ) which are 
set at the Bragg angle can be obtained from the SAD. 
Often the planes are near to, but not exactly at, the 
Bragg angle and it is necessary to determine the precise 
deviation which is usually represented by the param- 
eter 5, as shown in the Ewald sphere construction in 
Figure 5.29. The deviation parameter s is determined 
from Kikuchi lines, observed in diffraction patterns 
obtained from somewhat thicker areas of the specimen, 
which form a pair of bright and dark lines associated 
with each reflection, spaced \g\ apart. 

The Kikuchi lines arise from inelastically- scattered 
rays, originating at some point P in the specimen (see 
Figure 5.30), being subsequently Bragg-diffracted. 
Thus, for the set of planes in Figure 5.30a, those 
electrons travelling in the directions PQ and PR will 
be Bragg-diffracted at Q and R and give rise to rays 
in the directions QQ' and RR ; . Since the electrons 
in the beam RR' originate from the scattered ray 
PR, this beam will be less intense than QQ', which 
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Figure 5.29 Schematic diagram to illustrate the 
determination of s at the symmetry position, together with 
associated diffraction pattern. 


contains electrons scattered through a smaller angle at 
P. Because P is a spherical source this rediffraction at 
points such as Q and R gives rise to cones of rays 
which, when they intersect the film, approximate to 
straight lines. 

The selection of the diffracting conditions used 
to image the crystal defects can be controlled using 
Kikuchi lines. Thus the planes (hkl) are at the Bragg 
angle when the corresponding pair of Kikuchi lines 
passes through 000 and gnu, i.e. 5 = 0. Tilting of the 
specimen so that this condition is maintained (which 
can be done quite simply, using modem double-tilt 
specimen stages) enables the operator to select a spec- 
imen orientation with a close approximation to two- 
beam conditions. Tilting the specimen to a particular 
orientation, i.e. electron beam direction, can also be 
selected using the Kikuchi lines as a ‘navigational’ 
aid. The series of Kikuchi lines make up a Kikuchi 
map, as shown in Figure 5.30b, which can be used to 
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Figure 5.30 Kikuchi lines, (a) Formation of and (b) from 
fee crystal forming a Kikuchi map. 

tilt from one pole to another (as one would use an 
Underground map). 

5. 4. 4. 2 Convergent beam diffraction patterns 
When a selected area diffraction pattern is taken with 
a convergent beam of electrons, the resultant pattern 
contains additional structural information. A ray dia- 
gram illustrating the formation of a convergent beam 
diffraction pattern (CBDP) is shown in Figure 5.31a. 
The discs of intensity which are formed in the back 
focal plane contain information which is of three types: 

1. Fringes within discs formed by strongly diffracted 

beams. If the crystal is tilted to 2-beam conditions, 

these fringes can be used to determine the specimen 

thickness very accurately. 


2. High- angle information in the form of fine lines 
(somewhat like Kikuchi lines) which are visible 
in the direct beam and in the higher-order Laue 
zones (HOLZ). These HOLZ are visible in a pattern 
covering a large enough angle in reciprocal space. 
The fine line pattern can be used to measure the 
lattice parameter to 1 in 10 1 * * 4 . Figure 5.31b shows 
an example of HOLZ lines for a silicon crystal 
centred [1 1 1]. Pairing a dark line through the zero- 
order disc with its corresponding bright line through 
the higher-order disc allows the lattice parameter to 
be determined, the distance between the pair being 
sensitive to the temperature, etc. 

3. Detailed structure both within the direct beam and 
within the diffracted beams which show certain 
well-defined symmetries when the diffraction pat- 
tern is taken precisely along an important zone axis. 
The patterns can therefore be used to give crystal 
structure information, particularly the point group 
and space group. This information, together with 
the chemical composition from EELS or EDX, and 
the size of the unit cell from the indexed diffraction 
patterns can be used to define the specific crys- 
tal structure, i.e. the atomic positions. Figure 5.31c 
indicates the threefold symmetry in a CBDP from 
silicon taken along the [111] axis. 

5. 4. 4. 3 Higher- voltage electron microscopy 
The most serious limitation of conventional transmis- 
sion electron microscopes (CTEM) is the limited thick- 
ness of specimens examined (50-500 nm). This makes 
preparation of samples from heavy elements difficult, 
gives limited containment of particles and other struc- 
tural features within the specimen, and restricts the 
study of dynamical processes such as deformation, 
annealing, etc., within the microscope. However, the 
usable specimen thickness is a function of the acceler- 
ating voltage and can be increased by the use of higher 
voltages. Because of this, higher-voltage microscopes 
(HVEM) have been developed. 

The electron wavelength X decreases rapidly with 
voltage and at 1000 kV the wavelength X ~ 0.001 nm. 
The decrease in X produces corresponding decreases 
in the Bragg angles 6 , and hence the Bragg angles at 
1000 kV are only about one third of their correspond- 
ing values at 100 kV. One consequence of this is that 
an additional projector lens is usually included in high- 
voltage microscope. This is often called the diffraction 
lens and its purpose is to increase the diffraction cam- 
era length so that the diffraction spots are more widely 
spaced on the photographic plate. 

The principal advantages of HVEM are: (1) an 
increase in usable foil thickness and (2) a reduced ion- 
ization damage rate in ionic, polymer and biological 
specimens. The range of materials is therefore widened 
and includes (1) materials which are difficult to pre- 
pare as thin foils, such as tungsten and uranium and 
(2) materials in which the defect being studied is too 
large to be conveniently included within a 100 kV 
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Figure 5.31 (a) Schematic formation of convergent beam diffraction pattern in the backfocal plane of the objective lens, 

(b) and (c) (1 1 1) CBDPs from Si; (b) zero layer and HOLZ (Higher Order Laue Zones) in direct beam and (c) zero layer + 
FOLZ (First Order Laue Zones). 


specimen; these include large voids, precipitates and 
some dislocation structures such as grain boundaries. 

Many processes such as recrystallization, defor- 
mation, recovery, martensitic transformation, etc. are 
dominated by the effects of the specimen surfaces in 
thin samples and the use of thicker foils enables these 
phenomena to be studied as they occur in bulk mate- 
rials. With thicker foils it is possible to construct intri- 
cate stages which enable the specimen to be cooled, 
heated, strained and exposed to various chemical envi- 
ronments while it is being looked through. 

A disadvantage of HVEM is that as the beam voltage 
is raised the energy transferred to the atom by the 
fast electron increases until it becomes sufficient to 
eject the atom from its site. The amount of energy 
transferred from one particle to another in a collision 
depends on the ratio of the two masses (see Chapter 4). 
Because the electron is very light compared with an 
atom, the transfer of energy is very inefficient and the 
electron needs to have several hundred keV before it 
can transmit the 25 eV or so necessary to displace an 
atom. To avoid radiation damage it is necessary to 
keep the beam voltage below the critical displacement 
value which is ~100 kV for Mg and ~1300 kV for 
Au. There is, however, much basic scientific interest 
in radiation damage for technological reasons and a 
HVEM enables the damage processes to be studied 
directly. 

5.4.5 Chemical microanalysis 

5.4.5. 1 Exploitation of characteristic X-rays 
Electron probe microanalysis (EPMA) of bulk sam- 
ples is now a routine technique for obtaining rapid, 
accurate analysis of alloys. A small electron probe 
(~100 nm diameter) is used to generate X-rays from 
a defined area of a polished specimen and the inten- 
sity of the various characteristic X-rays measured using 


either wavelength-dispersive spectrometers (WDS) or 
energy-dispersive spectrometers (EDS). Typically the 
accuracy of the analysis is ±0.1%. One of the lim- 
itations of EPMA of bulk samples is that the vol- 
ume of the sample which contributes to the X-ray 
signal is relatively independent of the size of the 
electron probe, because high-angle elastic scattering 
of electrons within the sample generates X-rays (see 
Figure 5.32). The consequence of this is that the spatial 
resolution of EPMA is no better than ~2 pm. In the 
last few years EDX detectors have been interfaced to 
transmission electron microscopes which are capable 
of operating with an electron probe as small as 2 nm. 
The combination of electron-transparent samples, in 
which high-angle elastic scattering is limited, and a 
small electron probe leads to a significant improvement 
in the potential spatial resolution of X-ray microanal- 
ysis. In addition, interfacing of energy loss spectrom- 
eters has enabled light elements to be detected and 
measured, so that electron microchemical analysis is 
now a powerful tool in the characterization of materi- 
als. With electron beam instrumentation it is required 
to measure (1) the wavelength or energies of emitted 
X-rays (WDX and EDX), (2) the energy losses of the 
fast electrons (EELS), and (3) the energies of emitted 
electrons (AES). Nowadays (1) and (2) can be carried 
out on the modem TEM using special detector systems, 
as shown schematically in Figure 5.33. 

In a WDX spectrometer a crystal of known d- 
spacing is used which diffracts X-rays of a spe- 
cific wavelength, X, at an angle 0, given by the 
Bragg equation, nX = 2d sin 0. Different wavelengths 
are selected by changing 0 and thus to cover the neces- 
sary range of wavelengths, several crystals of different 
d-spacings are used successively in a spectrometer. 
The range of wavelength is 0.1 -2.5 nm and the corre- 
sponding d- spacing for practicable values of 6 , which 
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Figure 5.32 Schematic diagram showing the generation of 
electrons and X-rays within the specimen. 
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Figure 5.33 Schematic diagram of EDX and EELS in TEM. 


lie between ~15° and 65°, is achieved by using crystals 
such as LiF, quartz, mica, etc. In a WDX spectrometer 
the specimen (which is the X-ray source), a bent crys- 
tal of radius 2 r and the detector all lie on the focusing 
circle radius r and different wavelength X-rays are col- 
lected by the detector by setting the crystal at different 
angles, 6. The operation of the spectrometer is very 
time-consuming since only one particular X-ray wave- 
length can be focused on to the detector at any one 
time. 

The resolution of WDX spectrometers is controlled 
by the perfection of the crystal, which influences the 
range of wavelengths over which the Bragg condition 
is satisfied, and by the size of the entrance slit to the X- 
ray detector; taking the resolution (AX) to ~ 0.001 nm 


then X/ A X is about 300 which, for a medium atomic 
weight sample, leads to a peak- background ratio of 
about 250. The crystal spectrometer normally uses a 
proportional counter to detect the X-rays, producing 
an electrical signal, by ionization of the gas in the 
counter, proportional to the X-ray energy, i.e. inversely 
proportional to the wavelength. The window of the 
counter needs to be thin and of low atomic number 
to minimize X-ray absorption. The output pulse from 
the counter is amplified and differentiated to produce a 
short pulse. The time constant of the electrical circuit 
is of the order of 1 ps which leads to possible count 
rates of at least 10 5 /s. 

In recent years EDX detectors have replaced WDX 
detectors on transmission microscopes and are used 
together with WDX detectors on microprobes and on 
SEMs. A schematic diagram of a Si-Li detector is 
shown in Figure 5.34. X-rays enter through the thin 
Be window and produce electron-hole pairs in the 
Si-Li. Each electron-hole pair requires 3.8 eV, at the 
operating temperature of the detector, and the number 
of pairs produced by a photon of energy E v is thus 
£p/3.8. The charge produced by a typical X-ray photon 
is ~ ICE 16 C and this is amplified to give a shaped 
pulse, the height of which is then a measure of the 
energy of the incident X-ray photon. The data are 
stored in a multi-channel analyser. Provided that the 
X-ray photons arrive with a sufficient time interval 
between them, the energy of each incident photon can 
be measured and the output presented as an intensity 
versus energy display. The amplification and pulse 
shaping takes about 50 jis and if a second pulse arrives 
before the preceding pulse is processed, both pulses are 
rejected. This results in significant dead time for count 
rates >4000/s. 

The number of electron-hole pairs generated by an 
X-ray of a given energy is subject to normal statisti- 
cal fluctuations and this, taken together with electronic 
noise, limits the energy resolution of a Si-Li detec- 
tor to about a few hundred eV, which worsens with 
increase in photon energy. The main advantage of 
EDX detectors is that simultaneous collection of the 
whole range of X-rays is possible and the energy char- 
acteristics of all the elements >Z = 1 1 in the Periodic 
Table can be obtained in a matter of seconds. The main 



Figure 5.34 Schematic diagram of Si-Li X-ray detector. 
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disadvantages are the relatively poor resolution, which 
leads to a peak-background ratio of about 50, and the 
limited count rate. 

The variation in efficiency of a Si -Li detector must 
be allowed for when quantifying X-ray analysis. At 
low energies (<1 kV) the X-rays are mostly absorbed 
in the Be window and at high energies (>20 kV), the 
X-rays pass through the detector so that the decreasing 
cross-section for electron-hole pair generation results 
in a reduction in efficiency. The Si -Li detector thus 
has optimum detection efficiency between about 1 and 
20 kV. 

5. 4. 5. 2 Electron microanalysis of thin foils 

There are several simplifications which arise from the 
use of thin foils in microanalysis. The most important 
of these arises from the fact that the average energy 
loss which electrons suffer on passing through a thin 
foil is only about 2%, and this small average loss 
means that the ionization cross-section can be taken 
as a constant. Thus the number of characteristic X- 
ray photons generated from a thin sample is given 
simply by the product of the electron path length and 
the appropriate cross-section Q , i.e. the probability of 
ejecting the electron, and the fluorescent yield go. The 
intensity generated by element A is then given by 

I a = iQcon 

where Q is the cross-section per cm 2 for the particular 
ionization event, go the fluorescent yield, n the number 
of atoms in the excited volume, and i the current inci- 
dent on the specimen. Microanalysis is usually carried 
out under conditions where the current is unknown and 
interpretation of the analysis simply requires that the 
ratio of the X-ray intensities from the various elements 
be obtained. For the simple case of a very thin speci- 
men for which absorption and X-ray fluorescence can 
be neglected, then the measured X-ray intensity from 
element A is given by 

I A a n A Q A co A a A r] A 

and for element B by 

7b oc HbQb^b^b^Ib 

where n,Q, go, a and r\ represent the number of atoms, 
the ionization cross-sections, the fluorescent yields, the 
fraction of the K line (or L and M) which is collected 
and the detector efficiencies, respectively, for elements 
A and B. Thus in the alloy made up of elements A 
and B 

ft a I a Qbcob^b^]b „ 7a 

OC = K AB 

ft B I bQ A^aCIa?) A 7b 

This equation forms the basis for X-ray microanaly- 
sis of thin foils where the constant K AB contains all 
the factors needed to correct for atomic number differ- 
ences, and is known as the Z-correction. Thus from the 
measured intensities, the ratio of the number of atoms 


A to the number of atoms B, i.e. the concentrations of 
A and B in an alloy, can be calculated using the com- 
puted values for Q, go, r], etc. A simple spectrum for 
stoichiometric NiAl is shown in Figure 5.35 and the 
values of 7^ and 7^, obtained after stripping the back- 
ground, are given in Table 5.2 together with the final 
analysis. The absolute accuracy of any X-ray analysis 
depends either on the accuracy and the constants Q, go, 
etc. or on the standards used to calibrate the measured 
intensities. 

If the foil is too thick then an absorption 
correction (A) may have to be made to the measured 
intensities, since in traversing a given path length 
to emerge from the surface of the specimen, the X- 
rays of different energies will be absorbed differently. 
This correction involves a knowledge of the specimen 
thickness which has to be determined by one of various 
techniques but usually from CBDPs. Occasionally 
a fluorescence (F) correction is also needed since 
element Z + 2. This ‘nostandards ’ Z(AF) analysis can 
given an overall accuracy of ~2% and can be carried 
out on-line with laboratory computers. 

5.4.6 Electron energy loss spectroscopy 
(EELS) 

A disadvantage of EDX is that the X-rays from the 
light elements are absorbed in the detector window. 
Windowless detectors can be used but have some 
disadvantages, such as the overlapping of spectrum 
lines, which have led to the development of EELS. 

EELS is possible only on transmission specimens, 
and so electron spectrometers have been interfaced 
to TEMs to collect all the transmitted electrons lying 
within a cone of width a. The intensity of the various 
electrons, i.e. those transmitted without loss of energy 
and those that have been inelastically scattered and lost 
energy, is then obtained by dispersing the electrons 
with a magnetic prism which separates spatially the 
electrons of different energies. 

A typical EELS spectrum illustrated in Figure 5.36 
shows three distinct regions. The zero loss peak is 
made up from those electrons which have (1) not 
been scattered by the specimen, (2) suffered photon 
scattering (~l/40 eV) and (3) elastically scattered. 
The energy width of the zero loss peak is caused 
by the energy spread of the electron source (up to 
~2 eV for a thermionic W filament) and the energy 
resolution of the spectrometer (typically a few eV). 
The second region of the spectrum extends up to about 
50 eV loss and is associated with plasmon excitations 
corresponding to electrons which have suffered one, 
two, or more plasmon interactions. Since the typical 
mean free path for the generation of a plasmon is 
about 50 nm, many electrons suffer single-plasmon 
losses and only in specimens which are too thick for 
electron loss analysis will there be a significant third 
plasmon peak. The relative size of the plasmon loss 
peak and the zero loss peak can also be used to measure 
the foil thickness. Thus the ratio of the probability of 
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exciting a plasmon loss, Pi , to not exciting a plasmon, 
P 0 , is given by Pi/Po = t/L , where t is the thickness, 
L the mean free path for plasmon excitation, and Pi 
and P 0 are given by the relative intensities of the 
zero loss and the first plasmon peak. If the second 
plasmon peak is a significant fraction of the first peak 
this indicates that the specimen will be too thick for 
accurate microanalysis. 

The third region is made up of a continuous back- 
ground on which the characteristic ionization losses 
are superimposed. Qualitative elemental analysis can 
be carried out simply by measuring the energy of the 
edges and comparing them with tabulated energies. 
The actual shape of the edge can also help to define 
the chemical state of the element. Quantitative analysis 
requires the measurement of the ratios of the intensities 
of the electrons from elements A and B which have 
suffered ionization losses. In principle, this allows the 
ratio of the number of A atoms, N a, and B atoms, N B , 
to be obtained simply from the appropriate ionization 
cross-sections, Qk- Thus the number of A atoms will 
be given by 

Na = (i /Q&ul/io] 

and the number of B atoms by a similar expression, so 
that 

n a /n b = i^q b k /iIq^ 

where /£ is the measured intensity of the K edge for 
element A, similarly for /| and I 0 is the measured 
intensity of the zero loss peak. This expression is 
similar to the thin foil EDX equation. 

To obtain / K the background has to be removed 
so that only loss electrons remain. Because of the 
presence of other edges there is a maximum energy 
range over which / K can be measured which is 
about 50-100 eV. The value of 2 k must therefore be 
replaced by g K (A) which is a partial cross-section cal- 
culated for atomic transition within an energy range A 
of the ionization threshold. Furthermore, only the loss 
electrons arising from an angular range of scatter a at 
the specimen are collected by the spectrometer so that 
a double partial cross-section Q( A, a) is appropriate. 
Thus analysis of a binary alloy is carried out using the 
equation 

Na = gf(A, oQ/£( A, a) 

N b 2k(A, a)/I(A, a) 

Values of Q( A, a) may be calculated from data in the 
literature for the specific value of ionization edge, A, a 
and incident accelerating voltage, but give an analysis 
accurate to only about 5%; a greater accuracy might 
be possible if standards are used. 

5.4.7 Auger electron spectroscopy (AES) 

Auger electrons originate from a surface layer a few 
atoms thick and therefore AES is a technique for study- 
ing the composition of the surface of a solid. It is 


obviously an important method for studying oxidation, 
catalysis and other surface chemical reactions, but has 
also been used successfully to determine the chem- 
istry of fractured interfaces and grain boundaries (e.g. 
temper embrittlement of steels). 

The basic instrumentation involves a focusable elec- 
tron gun, an electron analyser and a sample support 
and manipulation system, all in an ultra-high-vacuum 
environment to minimize adsorption of gases onto the 
surface during analysis. Two types of analyser are in 
use, a cylindrical mirror analyser (CMA) and a hemi- 
spherical analyser (HSA), both of which are of the 
energy-dispersive type as for EELS, with the differ- 
ence that the electron energies are much lower, and 
electrostatic rather than magnetic Tenses’ are used to 
separate out the electrons of different energies. 

In the normal distribution the Auger electron peaks 
appear small on a large and often sloping background, 
which gives problems in detecting weak peaks since 
amplification enlarges the background slope as well 
as the peak. It is therefore customary to differentiate 
the spectrum so that the Auger peaks are emphasized 
as doublet peaks with a positive and negative displace- 
ment against a nearly flat background. This is achieved 
by electronic differentiation by applying a small a.c. 
signal of a particular frequency in the detected signal. 
Chemical analysis through the outer surface layers can 
be carried out by depth profiling with an argon ion gun. 

5.5 Observation of defects 

5.5.1 Etch pitting 

Since dislocations are regions of high energy, their 
presence can be revealed by the use of an etchant 
which chemically attacks such sites preferentially. This 
method has been applied successfully in studying 
metals, alloys and compounds, and there are many 
fine examples in existence of etch-pit patterns show- 
ing small-angle boundaries and pile-ups. Figure 5.37a 
shows an etch-pit pattern from an array of piled-up dis- 
locations in a zinc crystal. The dislocations are much 
closer together at the head of the pile-up, and an anal- 
ysis of the array, made by Gilman, shows that their 
spacing is in reasonable agreement with the theory of 
Eshelby, Frank and Nabarro, who have shown that the 
number of dislocations n that can be packed into a 
length L of slip plane is n = 2 Lt/ [± b, where r is the 
applied stress. The main disadvantage of the technique 
is its inability to reveal networks or other arrangements 
in the interior of the crystal, although some informa- 
tion can be obtained by taking sections through the 
crystal. Its use is also limited to materials with low 
dislocation contents (<10 4 mm -2 ) because of the lim- 
ited resolution. In recent years it has been successfully 
used to determine the velocity v of dislocations as a 
function of temperature and stress by measuring the 
distance travelled by a dislocation after the application 
of a stress for a known time (see Chapter 7). 



The characterization of materials 155 




Figure 5.37 Direct observation of dislocations, (a) Pile-up in a zinc single crystal (after Gilman, 1956, p. 1000). 
(b) Frank-Read source in silicon (after Dash, 1957; courtesy of John Wiley and Sons). 


5.5.2 Dislocation decoration 

It is well-known that there is a tendency for solute 
atoms to segregate to grain boundaries and, since these 
may be considered as made up of dislocations, it is 
clear that particular arrangements of dislocations and 
sub-boundaries can be revealed by preferential precip- 
itation. Most of the studies in metals have been carried 
out on aluminium- copper alloys, to reveal the dislo- 
cations at the surface, but recently several decoration 
techniques have been devised to reveal internal struc- 
tures. The original experiments were made by Hedges 
and Mitchell in which they made visible the disloca- 
tions in AgBr crystals with photographic silver. After 
a critical annealing treatment and exposure to light, the 
colloidal silver separates along dislocation lines. The 
technique has since been extended to other halides, and 
to silicon where the decoration is produced by diffus- 
ing copper into the crystal at 900° C so that on cooling 
the crystal to room temperature, the copper precipi- 
tates. When the silicon crystal is examined optically, 
using infrared illumination, the dislocation-free areas 
transmit the infrared radiation, but the dislocations dec- 
orated with copper are opaque. A fine example of 
dislocations observed using this technique is shown 
in Figure 5.37b. 


The technique of dislocation decoration has the 
advantage of revealing internal dislocation networks 
but, when used to study the effect of cold- work on the 
dislocation arrangement, suffers the disadvantage of 
requiring some high-temperature heat-treatment dur- 
ing which the dislocation configuration may become 
modified. 

5.5.3 Dislocation strain contrast in TEM 

The most notable advance in the direct observation of 
dislocations in materials has been made by the applica- 
tion of transmission techniques to thin specimens. The 
technique has been used widely because the disloca- 
tion arrangements inside the specimen can be studied. 
It is possible, therefore, to investigate the effects of 
plastic deformation, irradiation, heat- treatment, etc. on 
the dislocation distribution and to record the move- 
ment of dislocations by taking cine-films of the images 
on the fluorescent screen of the electron microscope. 
One disadvantage of the technique is that the materi- 
als have to be thinned before examination and, because 
the surface-to-volume ratio of the resultant specimen 
is high, it is possible that some rearrangement of dis- 
locations may occur. 

A theory of image contrast has been developed 
which agrees well with experimental observations. The 
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basic idea is that the presence of a defect in the lattice 
causes displacements of the atoms from their position 
in the perfect crystal and these lead to phase changes 
in the electron waves scattered by the atoms so that 
the amplitude diffracted by a crystal is altered. The 
image seen in the microscope represents the electron 
intensity distribution at the lower surface of the speci- 
men. This intensity distribution has been calculated by 
a dynamical theory (see Section 5.5.7) which considers 
the coupling between the diffracted and direct beams 
but it is possible to obtain an explanation of many 
observed contrast effects using a simpler (kinematical) 
theory in which the interactions between the transmit- 
ted and scattered waves are neglected. Thus if an elec- 
tron wave, represented by the function exp (: Imk^.r ) 
where k 0 is the wave vector of magnitude 1/A., is inci- 
dent on an atom at position r there will be an elastically 
scattered wave exp (2mk\.r) with a phase difference 
equal to 2nr{k\ — ko) when k\ is the wave vector of the 
diffracted wave. If the crystal is not oriented exactly 
at the Bragg angle the reciprocal lattice point will be 
either inside or outside the reflecting sphere and the 
phase difference is then 2n r(g + s) where g is the 
reciprocal lattice vector of the lattice plane giving rise 
to reflection and s is the vector indicating the devia- 
tion of the reciprocal lattice point from the reflection 
sphere (see Figure 5.39). To obtain the total scattered 
amplitude from a crystal it is necessary to sum all the 
scattered amplitudes from all the atoms in the crys- 
tal, i.e. take account of all the different path lengths 
for rays scattered by different atoms. Since most of 
the intensity is concentrated near the reciprocal lattice 
point it is only necessary to calculate the amplitude 
diffracted by a column of crystal in the direction of 
the diffracted by a column of crystal in the direction 
of the diffracted beam and not the whole crystal, as 
shown in Figure 5.38. The amplitude of the diffracted 


beam 0 g for an incident amplitude 0 O = 1, is then 
(p g = (7Ti/§ g ) / exp [— 27ri(g + s).r]dr 

Jo 

and since r.s is small and g.r is an integer this reduces 
to 


= (Jr i/?g) [ 

Jo 


exp [— 27iis.r]dr 


= (7ri/§ g ) / exp [— 27iisz]dz 

Jo 

where z is taken along the column. The intensity from 
such a column is 

|^g| 2 = I g = br 2 /£;](sin 2 7rts/(7rs) 2 ) 


from which it is evident that the diffracted intensity 
oscillates with depth z in the crystal with a periodicity 
equal to 1 /s. The maximum wavelength of this oscil- 
lation is known as the extinction 1 distance § g since the 
diffracted intensity is essentially extinguished at such 
positions in the crystal. This sinusoidal variation of 
intensity gives rise to fringes in the electron-optical 
image of boundaries and defects inclined to the foil 
surface, e.g. a stacking fault on an inclined plane is 
generally visible on an electron micrograph as a set 
of parallel fringes running parallel to the intersec- 
tion of the fault plane with the plane of the foil (see 
Figure 5.43). 

In an imperfect crystal, atoms are displaced from 
their true lattice positions. Consequently, if an atom 
at r n is displaced by a vector R, the amplitude of 
the wave diffracted by the atom is multiplied by 


! £ g = jtV cos 6fkF where V is the volume of the unit cell, 
0 the Bragg angle and F the structure factor. 


( a ) incident beam (b) 



Direct Diffracted 
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Figure 5.38 (a) Column approximation used to calculate the amplitude of the diffracted beam </> g on the bottom surface of the 
crystal. The dislocation is at a depth y and a distance x from the column, (b) Variation of intensity with depth in a crystal. 
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an additional phase factor exp [2m{k\ — Then, 

since (k\ — k 0 ) = g + s the resultant amplitude is 

0 g = (7Ti/§ g ) [ exp [— 2jri(g + s).(r + R)]dr 
Jo 


If we neglect s.R which is small in comparison with 
g. R, and g.r which gives an integer, then in terms of 
the column approximation 


4>g = 



exp (— 27iisz) exp (— 2mg.R)dz 


The amplitude, and hence the intensity, therefore 
may differ from that scattered by a perfect crystal, 
depending on whether the phase factor a = 2ng.R is 
finite or not, and image contrast is obtained when 
g.R ^ 0. 


5.5.4 Contrast from crystals 

In general, crystals observed in the microscope appear 
light because of the good transmission of electrons. In 
detail, however, the foils are usually slightly buckled 
so that the orientation of the crystal relative to the 
electron beam varies from place to place, and if one 
part of the crystal is oriented at the Bragg angle, strong 
diffraction occurs. Such a local area of the crystal 
then appears dark under bright-field illuminations, and 
is known as a bend or extinction contour. If the 
specimen is tilted while under observation, the angular 
conditions for strong Bragg diffraction are altered, and 
the extinction contours, which appear as thick dark 
bands, can be made to move across the specimen. To 
interpret micrographs correctly, it is essential to know 
the correct sense of both g and s. The g- vector is the 
line joining the origin of the diffraction pattern to the 
strong diffraction spot and it is essential that its sense is 
correct with respect to the micrograph, i.e. to allow for 
any image inversion or rotation by the electron optics. 
The sign of s can be determined from the position of 
the Kikuchi lines with respect to the diffraction spots, 
as discussed in Section 5.4.1. 


5.5.5 Imaging of dislocations 

Image contrast from imperfections arises from the 
additional phase factor a = 2jrg.R in the equation for 
the diffraction of electrons by crystals. In the case of 
dislocations the displacement vector R is essentially 
equal to b, the Burgers vector of the dislocation, since 
atoms near the core of the dislocation are displaced 
parallel to b. In physical terms, it is easily seen 
that if a crystal, oriented off the Bragg condition, 
i.e. s ^ 0, contains a dislocation then on one side 
of the dislocation core the lattice planes are tilted 
into the reflecting position, and on the other side of 
the dislocation the crystal is tilted away from the 
reflecting position. On the side of the dislocation 
in the reflecting position the transmitted intensity, 
i.e. passing through the objective aperture, will be 


less and hence the dislocation will appear as a line 
in dark contrast. It follows that the image of the 
dislocation will lie slightly to one or other side of 
the dislocation core, depending on the sign of (g.b)s. 
This is shown in Figure 5.39 for the case where the 
crystal is oriented in such a way that the incident beam 
makes an angle greater than the Bragg angle with 
the reflecting planes, i.e. s > 0. The image occurs on 
that side of the dislocation where the lattice rotation 
brings the crystal into the Bragg position, i.e. rotates 
the reciprocal lattice point onto the reflection sphere. 
Clearly, if the diffracting conditions change, i.e. g or 
s change sign, then the image will be displaced to the 
other side of the dislocation core. 

The phase angle introduced by a lattice defect is 
zero when g.R = 0, and hence there is no contrast, i.e. 
the defect is invisible when this condition is satisfied. 
Since the scalar product g.R is equal to gR cos 0, where 
6 is the angle between g and R, then g.R = 0 when the 
displacement vector R is normal to g, i.e. parallel to the 
reflecting plane producing the image. If we think of the 
lattice planes which reflect the electrons as mirrors, it is 
easy to understand that no contrast results when g.R = 
0, because the displacement vector R merely moves the 
reflecting planes parallel to themselves without altering 
the intensity scattered from them. Only displacements 
which have a component perpendicular to the reflecting 
plane, i.e. tilting the planes, will produce contrast. 

A screw dislocation only produces atomic displace- 
ments in the direction of its Burgers vector, and hence 
because R = b such a dislocation will be completely 
‘invisible’ when b lies in the reflecting plane producing 
the image. A pure edge dislocation, however, pro- 
duces some minor atomic displacements perpendicular 
to b , as discussed in Chapter 4, and the displacements 
give rise to a slight curvature of the lattice planes. An 
edge dislocation is therefore not completely invisible 
when b lies in the reflecting planes, but usually shows 
some evidence of faint residual contrast. In general, 



Figure 5.39 Schematic diagram showing the dependence of 
the dislocation image position on diffraction conditions. 
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(a) (b) 

Figure 5.40 (a) Application of the g.b = 0 criterion. The effect of changing the diffraction condition (see diffraction pattern 
inserts ) makes the long helical dislocation B in (a) disappear in (b) (after Hirsch, Howie and Whelan, 1960; courtesy of the 
Royal Society). 


however, a dislocation goes out of contrast when the 
reflecting plane operating contains its Burgers vector, 
and this fact is commonly used to determine the Burg- 
ers vector. To establish b uniquely, it is necessary to 
tilt the foil so that the dislocation disappears on at least 
two different reflections. The Burgers vector must then 
be parallel to the direction which is common to these 
two reflecting planes. The magnitude of b is usually 
the repeat distance in this direction. 

The use of the g.b = 0 criterion is illustrated in 
Figure 5.40. The helices shown in this micrograph 
have formed by the condensation of vacancies on to 
screw dislocations having their Burgers vector b par- 
allel to the axis of the helix. Comparison of the two 
pictures in (a) and (b) shows that the effect of tilt- 
ing the specimen, and hence changing the reflecting 
plane, is to make the long helix B in (a) disappear 
in (b). In detail, the foil has a [001] orientation and 
the long screws lying in this plane are 1/2 [1 1 0] 
and 1/2[1 10]. In Figure 5.40a the insert shows the 
020 reflection is operating and so g.b ^ 0 for either 
A or B, but in Figure 5.40b the insert shows that 
the 2 20 reflection is operating and the dislocation B 
is invisible since its Burgers vector b is normal 
to the g- vector, i.e. g.b = 2 20.1/2[1 10] = (^ x 1 x 
2) + (^ x 1 x 2) + 0 = 0 for the dislocation B, and is 
therefore invisible. 

5.5.6 Imaging of stacking faults 

Contrast at a stacking fault arises because such a 
defect displaces the reflecting planes relative to each 
other, above and below the fault plane, as illustrated in 


Figure 5.41a. In general, the contrast from a stacking 
fault will not be uniformly bright or dark as would 
be the case if it were parallel to the foil surface, 
but in the form of interference fringes running par- 
allel to the intersection of the foil surface with the 
plane containing the fault. These appear because the 
diffracted intensity oscillates with depth in the crystal 
as discussed. The stacking fault displacement vector 
R, defined as the shear parallel to the fault of the por- 
tion of crystal below the fault relative to that above the 
fault which is as fixed, gives rise to a phase difference 
a = 27rg.R in the electron waves diffracted from either 
side of the fault. It then follows that stacking-fault con- 
trast is absent with reflections for which a = 2n, i.e. 
for which g.R = n. This is equivalent to the g.b = 0 
criterion for dislocations and can be used to deduce R. 

The invisibility of stacking fault contrast when 
g.R = 0 is exactly analogous to that of a dislocation 
when g.b = 0, namely that the displacement vector is 
parallel to the reflecting planes. The invisibility when 
g . R = 1, 2, 3, . . . occurs because in these cases the 
vector R moves the imaging reflecting planes normal 
to themselves by a distance equal to a multiple of the 
spacing between the planes. From Figure 5.41b it can 
be seen that for this condition the reflecting planes are 
once again in register on either side of the fault, and, as 
a consequence, there is no interference between waves 
from the crystal above and below the fault. 

5.5.7 Application of dynamical theory 

The kinematical theory, although very useful, has limi- 
tations. The equations are only valid when the crystal is 
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Figure 5.41 Schematic diagram showing (a) displacement of reflecting planes by a stacking fault and (b) the condition 
forg.R = n when the fault would be invisible. 


oriented far from the exact Bragg condition, i.e. when 
s is large. The theory is also only strictly applicable 
for foils whose thickness is less than about half an 
extinction distance (|§ g ) and no account is taken of 
absorption. The dynamical theory has been developed 
to overcome these limitations. 

The object of the dynamical theory is to take into 
account the interactions between the diffracted and 
transmitted waves. Again only two beams are consid- 
ered, i.e. the transmitted and one diffracted beam, and 
experimentally it is usual to orient the specimen in 
a double-tilt stage so that only one strong diffracted 
beam is excited. The electron wave function is then 
considered to be made up of two plane waves -an inci- 
dent or transmitted wave and a reflected or diffracted 
wave 



t % 9 


Figure 5.42 Computed intensity profiles about the foil 
centre for a stacking fault with a = +2n/3. The full curve 
is the B.F. and the broken curve the D.F. image (from 
Hirsch, Howie et al., 1965). 


0(r) = 0 o exp(27riko.r) + 0 g exp(2:/rik 1 .r) 


The two waves can be considered to propagate together 
down a column through the crystal since the Bragg 
angle is small. Moreover, the amplitudes 0 O and f> g 
of the two waves are continually changing with depth 
z in the column because of the reflection of electrons 
from one wave to another. This is described by a pair 
of coupled first-order differential equations linking the 
wave amplitudes 0 o and f> g . Displacement of an atom 
R causes a phase change a = 2jrg.K in the scattered 
wave, as before, and the two differential equations 
describing the dynamical equilibrium between incident 
and diffracted waves 


#0 

dz 


jzi 


4 > g 


dz 


— 00 + 27T0 g 
5g 


S + g. 


dR 

dz 


These describe the change in reflected amplitude 0 g 
because electrons are reflected from the transmitted 
wave (this change is proportional to 0 O , the transmitted 
wave amplitude, and contains the phase factor) and the 
reflection in the reverse direction. 

These equations show that the effect of a displace- 
ment R is to modify s locally, by an amount propor- 
tional to the derivative of the displacement, i.e. dR/dz, 
which is the variation of displacement with depth z in 
the crystal. This was noted in the kinematical theory 
where dR/dz is equivalent to a local tilt of the lattice 
planes. The variation of the intensities |0 O | 2 and |0 g | 2 


for different positions of the column in the crystal, rel- 
ative to the defect, then gives the bright and dark-field 
images respectively. Figure 5.42 shows the bright- and 
dark-field intensity profiles from a stacking fault on an 
inclined plane, in full and broken lines, respectively. 
A wide variety of defects have been computed, some 
of which are summarized below: 

1. Dislocations In elastically isotropic crystals, perfect 
screw dislocations show no contrast if the condition 
g.b = 0 is satisfied. Similarly, an edge dislocation 
will be invisible if g.b = 0 and if g.b x u = 0 
where u is a unit vector along the dislocation line 
and b x u describes the secondary displacements 
associated with an edge dislocation normal to the 
dislocation line and b. The computations also show 
that for mixed dislocations and edge dislocations 
for which g.b x u<0.64 the contrast produced will 
be so weak as to render the dislocation virtually 
invisible. At higher values of g.b x u some contrast 
is expected. In addition, when the crystal becomes 
significantly anisotropic residual contrast can be 
observed even for g.b = 0. 

The image of a dislocation lies to one side of 
the core, the side being determined by ( g.b)s . Thus 
the image of a dislocation loop lies totally outside 
the core when (using the appropriate convention) 
(g.b)s is positive and inside when ( g.b)s is neg- 
ative. Vacancy and interstitial loops can thus be 
distinguished by examining their size after chang- 
ing from +g to —g, since these loops differ only in 
the sign of b. 
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2. Partial dislocations Partials for which g.b = 

(e.g. partial a/6[l 1 2] on (1 1 1) observed with 200 
reflection) will be invisible at both small and large 
deviations from the Bragg condition. Partials exam- 
ined under conditions for which g.b = ± | (i.e. par- 
tial a/6 [2 1 1] on (111) with 200 reflection) are 
visible except at large deviations from the Bragg 
condition. A partial dislocation lying above a simi- 
lar stacking fault is visible for g.b = and invis- 
ible for g.b = ±|. 

3. Stacking faults For stacking faults running from top 
to bottom of the foil, the bright-held image is sym- 
metrical about the centre, whereas the dark-held 
image is asymmetrical (see Figure 5.42). The top 
of the foil can thus be determined from the non- 
complementary nature of the fringes by comparing 
bright- and dark-held images. Moreover, the inten- 
sity of the hrst fringe is determined by the sign 


of the phase-factor a , such that when a is positive 
the hrst fringe is bright (corresponding to a higher 
transmitted intensity) and vice versa on a positive 
photographic print. 

It is thus possible to distinguish between intrinsic and 
extrinsic faults and an example is shown in Figure 5.43 
for an intrinsic fault on (1 1 1). The foil orientation is 
[110] and the non-complementary nature of the hrst 
fringe between B.F. and D.F. indicates the top of the 
foil, marked T. Furthermore, from the B.F. images the 
hrst fringe is bright with 111, and dark with 111 and 

III. 

5.5.8 Weak-beam microscopy 

One of the limiting factors in the analysis of defects is 
the fact that dislocation images have widths of § g /3, 
i.e. typically >10.0 nm. It therefore follows that dis- 
locations closer together than about 20.0 nm are not 
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Figure 5.43 Bright-field and dark-field micrographs of an intrinsic stacking fault in a copper-aluminium alloy; the operating 
diffraction vectors are (a) 1 1 1 (b) 1 1 1 and (c) 111 (after Howie and Valdre, 1963; courtesy of Taylor and Francis ). 
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(b) 


Figure 5.44 Symmetrical node in Fe-21Cr-14Ni stainless steel with y = 18 ±4 mJ/m 2 , (a) B.F. with g = 111 (b) weak 
beam with g(5g). 


generally resolved. With normal imaging techniques, 
the detail that can be observed is limited to a value 
about fifty to a hundred times greater than the reso- 
lution of the microscope. This limitation can be over- 
come by application of the weak-beam technique in 
which crystals are imaged in dark-field using a very 
large deviation parameter s. Under these conditions 
the background intensity is very low so that weak 
images are seen in very high contrast and the dis- 
location images are narrow, (~1.5 nm) as shown in 
Figure 5.44. At the large value of s used in weak-beam, 
the transfer of energy from the direct to the diffracted 
beam is very small, i.e. the crystal is a long way from 
the Bragg condition and there is negligible diffraction. 
Moreover, it is only very near the core of the dis- 
location that the crystal planes are sufficiently bent 
to cause the Bragg condition to be locally satisfied, 
i.e. g.(dR/dz) be large enough to satisfy the condi- 
tion [s + g.(dR/dz)] = 0. Therefore diffraction takes 
place from only a small volume near the centre of the 
dislocation, giving rise to narrow images. The abso- 
lute intensity of these images is, however, very small 
even though the signal-to-background ratio is high and 
hence long exposures are necessary to record them. 

5.6 Specialized bombardment 
techniques 

5.6.1 Neutron diffraction 

The advent of nuclear reactors stimulated the applica- 
tion of neutron diffraction to those problems of mate- 
rials science which could not be solved satisfactorily 
by other diffraction techniques. In a conventional pile 
the fast neutrons produced by fission are slowed down 


by repeated collisions with a ‘moderator’ of graphite 
or heavy water until they are slow enough to produce 
further fission. If a collimator is inserted into the pile, 
some of these slow neutrons 1 will emerge from it in 
the form of a beam, and the equivalent wavelength X 
of this neutron beam of energy E in electron-volts is 
given by X = 0.008 l/E. The equilibrium temperature 
in a pile is usually in the range 0-100°C, which cor- 
responds to a peak energy of several hundredths of 
an electron-volt. The corresponding wavelength of the 
neutron beam is about 0.15 nm and since this is very 
similar to the wavelength of X-rays it is to be expected 
that thermal neutrons will be diffracted by crystals. 

The properties of X-ray and neutron beams differ 
in many respects. The distribution of energy among 
the neutrons in the beam approximately follows the 
Maxwellian curve appropriate to the equilibrium tem- 
perature and, consequently, there is nothing which cor- 
responds to characteristic radiation. The neutron beam 
is analogous to a beam of ‘white’ X-rays, and as a 
result it has to be monochromatized before it can be 
used in neutron crystallography. Then, because only 
about 1 in 10 3 of the neutrons in the originally weak 
collimated beam are reflected from the monochroma- 
tor, it is necessary to employ very wide beams several 
inches in cross-section to achieve a sufficiently high 
counting rate on the boron trifluoride counter detec- 
tor (photographic detection is possible but not gen- 
erally useful). In consequence, neutron spectrometers, 
although similar in principle to X-ray diffractometers, 
have to be constructed on a massive scale. 

1 These may be called ‘thermal’ neutrons because they are 
in thermal equilibrium with their surroundings. 
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Neutron beams do, however, have advantages over 
X-rays or electrons, and one of these is the extremely 
low absorption of thermal neutrons by most elements. 
Table 5.3 shows that even in the most highly absorbent 
elements (e.g. lithium, boron, cadmium and gadolin- 
ium) the mass absorption coefficients are only of the 
same order as those for most elements for a comparable 
X-ray wavelength, and for other elements the neutron 
absorption is very much less indeed. This penetrative 
property of the neutron beam presents a wide scope 
for neutron crystallography, since the whole body of a 
specimen may be examined and not merely its surface. 
Problems concerned with preferred orientation, resid- 
ual stresses, cavitation and structural defects are but 
a few of the possible applications, some of which are 
discussed more fully later. 

Another difference concerns the intensity of scatter- 
ing per atom, / a . For X-rays, where the scattering is by 
electrons, the intensity / a increases with atomic num- 
ber and is proportional to the square of the atomic-form 
factor. For neutrons, where the scattering is chiefly by 
the nucleus, / a appears to be quite unpredictable. The 
scattering power per atom varies not only apparently 
at random from atom to atom, but also from isotope 
to isotope of the same atom. Moreover, the nuclear 
component to the scattering does not decrease with 
increasing angle, as it does with X-rays, because the 
nucleus which causes the scattering is about 10“ 12 mm 
in size compared with 10 -7 mm, which is the size of 
the electron cloud that scatters X-rays. Table 5.4 gives 
some of the scattering amplitudes for X-rays and ther- 
mal neutrons. 

The fundamental difference in the origin of scat- 
tering between X-rays and neutrons affords a method 
of studying structures, such as hydrides and carbides, 
which contain both heavy and light atoms. When X- 
rays are used, the weak intensity contributions of the 
light atoms are swamped by those from the heavy 
atoms, but when neutrons are used, the scattering 
power of all atoms is roughly of the same order. 
Similarly, structures made up of atoms whose atomic 
numbers are nearly the same (e.g. iron and cobalt, or 
copper and zinc), can be studied more easily by using 


Table 5.3 X-ray and neutron mass absorption coefficients 


Element 

At. no. 

X-rays 

(X = 0 .19 nm) 

Neutrons 
(X = 0.18 nm 

Li 

3 

1.5 

5.8 

B 

5 

5.8 

38.4 

C 

6 

10.7 

0.002 

A1 

13 

92.8 

0.005 

Fe 

26 

72.8 

0.026 

Cu 

29 

98.8 

0.03 

Ag 

47 

402 

0.3 

Cd 

48 

417 

13.0 

Gd 

61 

199 

183.0 

Au 

79 

390 

0.29 

Pb 

82 

429 

0.0006 


Table 5.4 Scattering amplitudes for X-rays and thermal 
neutrons 


Element 

At. no. 

Scattering amplitudes 


X-rays for 
sin 0/X = 0 .5 

xlO~ 12 

Neutrons* 

xlO~ 12 


H 

1 

0.02 


-0.4 

Li 

3 

0.28 

Li 6 

0.7 




Li 7 

-0.25 

C 

6 

0.48 


0.64 

N 

7 

0.54 


0.85 

O 

8 

0.62 


0.58 

At 

13 

1.55 


0.35 

Ti 

22 

2.68 


-0.38 

Fe 

26 

3.27 

Fe 56 

1.0 




Fe 57 

0.23 

Co 

27 

3.42 


0.28 

Cu 

29 

3.75 


0.76 

Zn 

30 

3.92 


0.59 

Ag 

47 

6.71 

Ag 107 

0.83 




Ag i09 

0.43 

Au 

79 

12.37 


0.75 


*The negative sign indicates that the scattered and incident 
waves are in phase for certain isotopes and hence for certain 
elements. Usually the scattered wave from an atom is 180° 
out of phase with the incident wave. 

neutrons. This aspect is discussed later in relation to 
the behaviour of ordered alloy phases. 

The major contribution to the scattering power arises 
from the nuclear component, but there is also an elec- 
tronic (magnetic spin) component to the scattering. 
This arises from the interaction between the mag- 
netic moment of the neutron and any resultant mag- 
netic moment which the atom might possess. As a 
result, the neutron diffraction pattern from paramag- 
netic materials, where the atomic moments are ran- 
domly directed (see Chapter 6), shows a broad diffuse 
background, due to incoherent (magnetic) scattering, 
superimposed on the sharp peaks which arise from 
coherent (nuclear) scattering. In ferromagnetic metals 
the atomic moments are in parallel alignment through- 
out a domain, so that this cause of incoherent scattering 
is absent. In some materials (e.g. NiO or FeO) an 
alignment of the spins takes place, but in this case 
the magnetization directions of neighbouring pairs of 
atoms in the structure are opposed and, in consequence, 
cancel each other out. For these materials, termed 
anti-ferromagnetic, there is no net spontaneous mag- 
netization and neutron diffraction is a necessary and 
important tool for investigating their behaviour (see 
Chapter 6). 

5.6.2 Synchrotron radiation studies 

Very large electrical machines known as synchrotron 
radiation sources (SRS) provide a unique source of 
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Figure 5.45 (a) Layout of SRS, Daresbury, and (b) wavelength spectrum of synchrotron radiation (after Barnes, 1990, 
pp. 708-715; by permission of the Institute of Metals). 


electromagnetic radiation for materials characterisa- 
tion. 1 Electrons from a hot cathode are accelerated 
in three stages by a linear accelerator (Linac), a 
booster synchrotron and an evacuated storage ring 
(Figure 5.45a). As bunches of electrons travel around 
the periphery of the storage ring they attain ener- 
gies of up to 2 GeV and velocities approaching that 
of light. At these relativistic velocities, electron mass 
becomes 4000 times greater than the rest mass. Dipole 
and quadrupole magnets constrain the bunches into 
an approximately circular orbit and, by accelerating 
them centripetally, cause electromagnetic radiation to 
be produced. The spectrum of this synchrotron radi- 
ation is very wide, extending from short-wavelength 
(‘hard’) X-rays to the infrared range (Figure 5.45b). A 
wiggler magnet produces a strong (5 tesla) field and 
can extend the spectrum to even shorter wavelengths. 
Compared with more orthodox sources of electromag- 
netic radiation, the synchrotron offers the advantages 
of very high intensity, short wavelengths, precise col- 
limation of the beam and a smooth, continuous spec- 
trum. The high radiation intensity permits exposure 
times that are often several orders of magnitude shorter 
than those for comparable laboratory methods. The 
risk of beam damage to specimens by the flashes of 
radiation is accordingly lessened. Specimens of met- 
als, ceramics, polymers, semiconductors, catalysts, etc. 

1980, the world’s first totally radiation-dedicated SRS 
came into operation at Daresbury, England. Electrons are 
‘stored’ in the main ring for 10-20 h, traversing its 96 m 
periphery more than 3 x 10 6 times per second. 


are placed in independent experimental stations located 
around the periphery of the ring chamber and irradi- 
ated in order to produce spectroscopic, diffraction or 
imaging effects. 

In the technique known as extended X-ray absorp- 
tion fine-structure spectroscopy (EXAFS) attention is 
directed to the small discontinuities on the higher- 
energy flank beyond each vertical, characteristic ‘edge’ 
which appears in a plot of mass absorption ver- 
sus X-ray wavelength. These ‘finestructure’ (FS) fea- 
tures derive from interference effects between electron 
waves from excited atoms and waves back- scattered 
from surrounding atoms. Mathematical treatment 
(using a Fourier transform) of the EXAFS spectra 
yields a radial distribution plot of surrounding atomic 
density versus distance from the excited atom. By 
selecting the ‘edge’ for a particular type of atom/ion 
and studying its fine structure, it is thus possible to 
obtain information on its local environment and coor- 
dination. This type of information is of great value 
in structural studies of materials, such as glasses, 
which only exhibit short-range order. For instance, the 
EXAFS technique has demonstrated that the network 
structure of Si0 2 -Na 2 0-Ca0 glass is threaded by per- 
colation channels of modifier (sodium) cations. 

5.6.3 Secondary ion mass spectrometry 
(SIMS) 

This important and rapidly-developing technique, 
which enables material surfaces to be analysed with 
great chemical sensitivity and excellent resolution in 
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depth, is based upon the well-known phenomenon of 
sputtering. The target surface is bombarded with a 
focused beam of primary ions that has been accelerated 
with a potential of 1-30 kV within a high-vacuum 
chamber (1CT 5 -10 _10 torr). These ions generate a 
series of collision cascades in a shallow surface layer, 
0.5-5 nm deep, causing neutral atoms and, to a much 
smaller extent, secondary ions to be ejected (sputtered) 
from the specimen surface. Thus, a metallic oxide 
(MO) sample may act as a source of M, O, M + , 0 + , 
M“, O - , MO + and MO - species. The secondary ions, 
which are thus either monatomic or clustered, positive 
or negative, are directed into a mass spectrometer 
(analyser), wherein they are sorted and identified 
according to their mass/charge ratio. Exceptionally 
high elemental sensitivities, expressed in parts per 
million and even parts per billion, are achievable. All 
elements in the Periodic Table can be analysed and it 
is possible to distinguish between individual isotopes. 
Studies of the self-diffusion of oxygen and nitrogen 
have been hindered because these light elements have 
no isotopes that can be used as radioactive tracers. 
SIMS based on the stable isotope 18 O provides a rapid 
method for determining self-diffusion coefficients. The 
physical process whereby ions are ejected is difficult 
to express in rigorous theoretical terms, consequently 
SIMS is usually semiquantitative, with dependence 
upon calibration with standard samples of known 
composition. SIMS is a valuable complement to other 
methods of surface analysis. 

The available range of beam diameter is 1 pm to 
several millimetres. Although various types of ion 
beam are available (e.g. Ar~, 32 0 2 + , 16 0“, Cs + , etc.) 
positively-charged beams are a common choice. How- 
ever, if the sample is insulating, positive charge tends 
to accumulate in the bombarded region, changing the 
effective value of the beam voltage and degrading the 
quality of signals. One partial remedy, applicable at 
low beam voltages, is to ‘flood’ the ion-bombarded area 
with a high-intensity electron beam. In some variants 
of SIMS laser beams are used instead of ion beams. 

Of the large and growing variety of methods 
covered by the term SIMS, the dynamic, static 
and imaging modes are especially useful. Materials 
being investigated include metals, ceramics, polymers, 
catalysts, semiconductors and composites. Dynamic 
SIMS, which uses a relatively high beam current, is an 
important method for determining the distribution and 
very low concentration of dopants in semiconductors. 
The beam scans a raster, 100-500 pm in size, and 
slowly erodes the surface of the sample. Secondary 
ions from the central region of the crater are analysed 
to produce a precise depth profile of concentration. 
Static SIMS uses a much smaller beam current and the 
final spectra tend to be more informative, providing 
chemical data on the top few atomic layers of the 
sample surface. Little surface damage occurs and the 
method has been applied to polymers. The imaging 
version of SIMS has a resolution comparable to SEM 


and provide ‘maps’ that show the lateral distribution 
of elements at grain boundaries and precipitated 
particles and hydrogen segregation in alloys. Imaging 
SIMS has been applied to transverse sections through 
the complex scale layers which form when alloys 
are exposed to hot oxidizing gases (e.g. 0 2 , C0 2 ). 
Its sensitivity is greater than that obtainable with 
conventional EDX in SEM analysis and has provided 
a better understanding of growth mechanisms and the 
special role of trace elements such as yttrium. 


5.7 Thermal analysis 

5.7.1 General capabilities of thermal analysis 

Heating a material at a steady rate can produce chemi- 
cal changes, such as oxidation and degradation, and/or 
physical changes, such as the glass transition in poly- 
mers, conversions/inversions in ceramics and phase 
changes in metals. Thermal analysis is used to comple- 
ment X-ray diffraction analysis, optical and electron 
microscopy during the development of new materi- 
als and in production control. Sometimes it is used 
to define the temperature and energy change associated 
with a structural change; at other times it is used quali- 
tatively to provide a characteristic ‘fingerprint’ trace of 
a particular material. The various techniques of thermal 
analysis measure one or more physical properties of a 
sample as a function of temperature. Figure 5.46 illus- 
trates three basic methods of thermal analysis, namely 
thermogravimetric analysis (TGA), differential thermal 
analysis (DTA) and differential scanning calorimetry 
(DSC). Respectively, they measure change in mass 
(TGA) and energy flow (DTA, DSC). They can apply 
programmed heating and cooling, but usually operate 
with a slowly rising temperature. The sample chamber 
may contain air, oxygen, nitrogen, argon, etc. or be 
evacuated. A sample of a few tens of milligrams will 
often suffice. 

Recently-developed methods have extended the 
range of thermal analysis and other aspects of 
behaviour can now be studied. For instance, 
using dynamic mechanical thermal analysis (DMTA), 
mechanical as well as structural information can be 
obtained on the viscoelastic response of a polymeric 
sample to tensile, bend or shear stresses during heating. 

5.7.2 Thermogravimetric analysis 

In a thermobalance the mass of a sample is contin- 
uously determined and recorded while the sample is 
being slowly heated (Figure 5.46a). Temperatures up 
to at least 1000°C are available. It has been applied 
to the decomposition of rubbers (Figure 5.47a), kinetic 
studies of metallic oxidation, glass transitions and soft- 
ening in polymers. Equilibrium is not attained within 
the sample and the method is insensitive to the more 
subtle solid-state changes. When changes overlap, it 
can be helpful to plot the first derivative, 8m/ 8t, of 
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Figure 5.46 Basic methods of thermal analysis, (a) Thermo gravimetric analysis (TGA). (b) differential thermal analysis (DTA) 
and (c) differential scanning calorimetry (DSC). 


the graphical trace in a procedure known as derivative 
thermogravimetric analysis (DTGA). 

5.7.3 Differential thermal analysis 

DTA 1 reveals changes during the heating of a sample 
which involve evolution or absorption of energy. As 
shown diagrammatically in Figure 5.46b, a sample S 
and a chemically and thermally inert reference material 
R (sintered alumina or precipitated silica) are mounted 
in a recessed heating block and slowly heated. The 
thermocouples in S and R are connected in opposi- 
tion; their temperature difference AT is amplified and 
plotted against temperature. Peak area on this trace is a 
function of the change in enthalpy (AH) as well as the 
mass and thermal characteristics of the sample S. Small 
samples can be used to give sharper, narrower peaks, 
provided that they are fully representative of the source 

Usually accredited to H. Le Chatelier (1887): improved 
version and forerunner of modern DTA used by 
W. C. Roberts-Austen (1899) in metallurgical studies of 
alloys. 


material. Ideally, the specific heat capacities of S and 
R should be similar. DTA is generally regarded as a 
semi-quantitative or qualitative method. It has been 
used in studies of devitrification in oxide glasses and 
the glass transition in polymers. Figure 5.47b shows 
a comparison of the thermal response of high-alumina 
cement (HAC) and Portland cement. The amount of an 
undesirable weakening phase can be derived from the 
relative lengths of the ordinates X and Y in the HAC 
trace. 

5.7.4 Differential scanning calorimetry 

In this method, unlike DTA, the sample and reference 
body have separate resistive heaters (Figure 5.46c). 
When a difference in temperature develops between 
sample S and reference R, an automatic control loop 
heats the cooler of the two until the difference is 
eliminated. The electrical power needed to accomplish 
this equalizer is plotted against temperature. An 
endothermic change signifies that an enthalpy increase 
has occurred in S; accordingly, its peak is plotted 
upwards (unlike DTA traces). Differences in thermal 
conductivity and specific heat capacity have no effect 
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Figure 5.47 Examples of thermal analysis (a) TGA curve for decomposition of rubber, showing decomposition of oil and 
polymer in N 2 up to 600° C and oxidation of carbon black in air above 600° C (Hill and Nicholas, 1989), (b) DTA curve for 
high-alumina cement and Portland cement ( Hill and Nicholas, 1989) and (c) DTA curve for a quenched glassy polymer (Hay, 
1982). 


and peak areas can be expressed as energy per unit 
mass. DSC has proved particularly valuable in polymer 
research, often being used in combination with other 
techniques, such as evolved gas analysis (EGA). DSC 
has been used in studies of the curing characteristics 
of rubbers and thermoset resins, transitions in 
liquid crystals and isothermal crystallization rates 
in thermoplastics. Figure 5.47c is a trace obtained 
for a quenched amorphous polymer. DSC has also 
been used in studies of the exothermic behaviour of 
cold- worked metals as they release ‘stored energy’ 
during annealing, energy absorption during eutectic 
melting of alloys, precipitation in aluminium-based 
alloys, relaxation transformations in metallic glasses 
and drying/firing transitions in clay minerals. 
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Chapter 6 

The physical properties of materials 


6.1 Introduction 

The ways in which any material interacts and responds 
to various forms of energy are of prime interest to sci- 
entists and, in the context of engineering, provide the 
essential base for design and innovation. The energy 
acting on a material may derive from force fields (grav- 
itational, electric, magnetic), electromagnetic radiation 
(heat, light, X-rays), high-energy particles, etc. The 
responses of a material, generally referred to as its 
physical properties, are governed by the structural 
arrangement of atoms/ions/molecules in the material. 
The theme of the structure/property relation which 
has run through previous chapters is developed fur- 
ther. Special attention will be given to the diffusion of 
atoms/ions within materials because of the importance 
of thermal behaviour during manufacture and service. 
In this brief examination, which will range from den- 
sity to superconductivity, the most important physical 
properties of materials are considered. 


6.2 Density 

This property, defined as the mass per unit volume of 
a material, increases regularly with increasing atomic 
numbers in each sub-group. The reciprocal of the 
density is the specific volume v, while the product of 
v and the relative atomic mass W is known as the 
atomic volume Q. The density may be determined by 
the usual ‘immersion’ method, but it is instructive to 
show how X-rays can be used. For example, a powder 
photograph may give the lattice parameter of an fee 
metal, say copper, as 0.36 nm. Then 1/(3. 6 x 10 -10 ) 3 
or 2.14 x 10 28 cells of this size (0.36 nm) are found 
in a cube 1 m edge length. The total number of 
atoms in 1 m 3 is then 4 x 2.14 x 10 28 = 8.56 x 10 28 
since an fee cell contains four atoms. Furthermore, 
the mass of a copper atom is 63.57 times the mass 
of a hydrogen atom (which is 1.63 x 10~ 24 g) so 


that the mass of 1 m 3 of copper, i.e. the density, is 
8.56 x 10 28 x 63.57 x 1.63 x 1(T 24 = 8900 kg m~ 3 . 

On alloying, the density of a metal changes. This 
is because the mass of the solute atom differs from 
that of the solvent, and also because the lattice 
parameter usually changes on alloying. The parameter 
change may often be deduced from Vegard’s law, 
which assumes that the lattice parameter of a solid 
solution varies linearly with atomic concentration, but 
numerous deviations from this ideal behaviour do 
exist. 

The density clearly depends on the mass of the 
atoms, their size and the way they are packed. Metals 
are dense because they have heavy atoms and close 
packing; ceramics have lower densities than metals 
because they contain light atoms, either C, N or O; 
polymers have low densities because they consist of 
light atoms in chains. Figure 6.1 shows the spread in 
density values for the different material classes. Such 
‘Material Property Charts’, as developed by Ashby, 
are useful when selecting materials during engineer- 
ing design. 

6.3 Thermal properties 

6.3.1 Thermal expansion 

If we consider a crystal at absolute zero temperature, 
the ions sit in a potential well of depth E ro below the 
energy of a free atom (Figure 6.2). The effect of raising 
the temperature of the crystal is to cause the ions 
to oscillate in this asymmetrical potential well about 
their mean positions. As a consequence, this motion 
causes the energy of the system to rise, increasing 
with increasing amplitude of vibration. The increasing 
amplitude of vibration also causes an expansion of the 
crystal, since as a result of the sharp rise in energy 
below r 0 the ions as they vibrate to and fro do not 
approach much closer than the equilibrium separation, 
r 0 , but separate more widely when moving apart. When 
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Figure 6.1 Strength o, plotted against density, p (yield strength for metals and polymers, compressive strength for ceramics, 
tear strength for elastomers and tensile strength for composites). The guide lines of constant o/ p, a 2 ! 3 / p and o 1 / 2 / p are used 
in minimum weight, yield-limited, design (Ashby, 1989, pp. 1273-93, with permission of Elsevier Science Ltd.). 



Figure 6.2 Variation in potential energy with interatomic 
distance. 

the distance r is such that the atoms are no longer 
interacting, the material is transformed to the gaseous 
phase, and the energy to bring this about is the energy 
of evaporation. 


The change in dimensions with temperature is 
usually expressed in terms of the linear coefficient 
of expansion a, given by a = (l//)(d//dr), where 
/ is the original length of the specimen and T is 
the absolute temperature. Because of the anisotropic 
nature of crystals, the value of a usually varies with 
the direction of measurement and even in a particular 
crystallographic direction the dimensional change with 
temperature may not always be uniform. 

Phase changes in the solid state are usually studied 
by dilatometry. The change in dimensions of a 
specimen can be transmitted to a sensitive dial gauge 
or electrical transducer by means of a fused silica 
rod. When a phase transformation takes place, because 
the new phase usually occupies a different volume 
to the old phase, discontinuities are observed in the 
coefficient of thermal expansion a versus T curve. 
Some of the ‘nuclear metals’ which exist in many 
allotropic forms, such as uranium and plutonium, show 
a negative coefficient along one of the crystallographic 
axes in certain of their allotropic modifications. 
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The change in volume with temperature is important 
in many metallurgical operations such as casting, 
welding and heat treatment. Of particular importance 
is the volume change associated with the melting 
or, alternatively, the freezing phenomenon since this 
is responsible for many of the defects, both of 
a macroscopic and microscopic size, which exist 
in crystals. Most metals increase their volume by 
about 3% on melting, although those metals which 
have crystal structures of lower coordination, such as 
bismuth, antimony or gallium, contract on melting. 
This volume change is quite small, and while the 
liquid structure is more open than the solid structure, 
it is clear that the liquid state resembles the solid 
state more closely than it does the gaseous phase. For 
the simple metals the latent heat of melting, which 
is merely the work done in separating the atoms from 
the close-packed structure of the solid to the more open 
liquid structure, is only about one thirtieth of the latent 
heat of evaporation, while the electrical and thermal 
conductivities are reduced only to three-quarters to 
one-half of the solid state values. 

6.3.2 Specific heat capacity 

The specific heat is another thermal property important 
in the processing operations of casting or heat 
treatment, since it determines the amount of heat 
required in the process. Thus, the specific heat (denoted 
by C p , when dealing with the specific heat at constant 
pressure) controls the increase in temperature, d T, 
produced by the addition of a given quantity of heat, 
d 2, to one gram of matter so that dQ = C p dT. 

The specific heat of a metal is due almost entirely 
to the vibrational motion of the ions. However, a 
small part of the specific heat is due to the motion 
of the free electrons, which becomes important at 
high temperatures, especially in transition metals with 
electrons in incomplete shells. 

The classical theory of specific heat assumes that 
an atom can oscillate in any one of three directions, 
and hence a crystal of N atoms can vibrate in 3N 
independent normal modes, each with its characteristic 
frequency. Furthermore, the mean energy of each nor- 
mal mode will be k T, so that the total vibrational 
thermal energy of the metal is E = 3NkT. In solid 
and liquid metals, the volume changes on heating are 
very small and, consequently, it is customary to con- 
sider the specific heat at constant volume. If N , the 
number of atoms in the crystal, is equal to the number 
of atoms in a gram-atom (i.e. Avogadro number), the 
heat capacity per gram-atom, i.e. the atomic heat, at 
constant volume is given by 

C v ( — ) = — = 3Nk = 24.95 J K” 1 
U T) v d T 

In practice, of course, when the specific heat is exper- 
imentally determined, it is the specific heat at constant 
pressure, C p , which is measured, not C v , and this is 


given by 

6E + PdV\ _d H 
dr ) p ~d T 

where H = E + PV is known as the heat content or 
enthalpy, C p is greater than C v by a few per cent 
because some work is done against interatomic forces 
when the crystal expands, and it can be shown that 

C p -C v = 9a 2 VT / ft 

where a is the coefficient of linear thermal expansion, 
V is the volume per gram- atom and ft is the 
compressibility. 

Dulong and Petit were the first to point out that the 
specific heat of most materials, when determined at 
sufficiently high temperatures and corrected to apply 
to constant volume, is approximately equal to 3R, 
where R is the gas constant. However, deviations 
from the ‘classical’ value of the atomic heat occur 
at low temperatures, as shown in Figure 6.3a. This 
deviation is readily accounted for by the quantum 
theory, since the vibrational energy must then be 
quantized in multiples of hv, where h is Planck’s 
constant and v is the characteristic frequency of the 
normal mode of vibration. 

According to the quantum theory, the mean energy 
of a normal mode of the crystal is 

E(y) = |hv + {hv/ exp (hv/k T) — 1} 

where ^hv represents the energy a vibrator will have 
at the absolute zero of temperature, i.e. the zero-point 
energy. Using the assumption made by Einstein (1907) 
that all vibrations have the same frequency (i.e. all 
atoms vibrate independently), the heat capacity is 

C v = (dE/dT) v 

= 3Nk(hv/kr) 2 

[exp (hv/k7’)/{exp (hv/k7’)— l} 2 ] 

This equation is rarely written in such a form because 
most materials have different values of v. It is more 
usual to express v as an equivalent temperature defined 
by 0 E = hv/k, where 0 E is known as the Einstein 
characteristic temperature. Consequently, when C v is 
plotted against r/0 E , the specific heat curves of all 
pure metals coincide and the value approaches zero at 
very low temperatures and rises to the classical value 
of 3Nk = 3 R~ 25.2 J/g at high temperatures. 

Einstein’s formula for the specific heat is in good 
agreement with experiment for T>0 E , but is poor for 
low temperatures where the practical curve falls off 
less rapidly than that given by the Einstein relationship. 
However, the discrepancy can be accounted for, as 
shown by Debye, by taking account of the fact that the 
atomic vibrations are not independent of each other. 
This modification to the theory gives rise to a Debye 
characteristic temperature 0 D , which is defined by 

k0 D = hv D 
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where v D is Debye’s maximum frequency. Figure 6.3b 
shows the atomic heat curves of Figure 6.3a plotted 
against T/0 D ; in most metals for low temperatures 
(r/0 D « 1) a f 3 law is obeyed, but at high temper- 
atures the free electrons make a contribution to the 
atomic heat which is proportional to T and this causes 
a rise of C above the classical value. 

6.3.3 The specific heat curve and 
transformations 

The specific heat of a metal varies smoothly with tem- 
perature, as shown in Figure 6.3a, provided that no 
phase change occurs. On the other hand, if the metal 
undergoes a structural transformation the specific heat 
curve exhibits a discontinuity, as shown in Figure 6.4. 
If the phase change occurs at a fixed temperature, the 
metal undergoes what is known as a first-order trans- 
formation; for example, the a to y, y to 8 and 8 to liq- 
uid phase changes in iron shown in Figure 6.4a. At the 
transformation temperature the latent heat is absorbed 
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Figure 6.3 The variation of atomic heat with temperature. 
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without a rise in temperature, so that the specific heat 
(dQ/dT) at the transformation temperature is infinite. 
In some cases, known as transformations of the sec- 
ond order, the phase transition occurs over a range 
of temperature (e.g. the order- disorder transformation 
in alloys), and is associated with a specific heat peak 
of the form shown in Figure 6.4b. Obviously the nar- 
rower the temperature range T\ — T c , the sharper is 
the specific heat peak, and in the limit when the total 
change occurs at a single temperature, i.e. T\ = T C , the 
specific heat becomes infinite and equal to the latent 
heat of transformation. A second-order transformation 
also occurs in iron (see Figure 6.4a), and in this case 
is due to a change in ferromagnetic properties with 
temperature. 

6.3.4 Free energy of transformation 

In Section 3. 2.3. 2 it was shown that any structural 
changes of a phase could be accounted for in terms 
of the variation of free energy with temperature. The 
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Figure 6.4 The effect of solid state transformations on the specific heat-temperature curve. 
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relative magnitude of the free energy value governs the 
stability of any phase, and from Figure 3.9a it can be 
seen that the free energy G at any temperature is in turn 
governed by two factors: (1) the value of G at 0 K, 
G 0 , and (2) the slope of the G versus T curve, i.e. the 
temperature-dependence of free energy. Both of these 
terms are influenced by the vibrational frequency, and 
consequently the specific heat of the atoms, as can be 
shown mathematically. For example, if the temperature 
of the system is raised from T to T + dT the change 
in free energy of the system dG is 

dG = dH — TdS - SdT 

= C p dT - T(C p dT/T) - SdT 
= -SdT 


so that the free energy of the system at a temperature 
T is 

G = G 0 — f SdT 
Jo 

At the absolute zero of temperature, the free energy 
G 0 is equal to H 0 , and then 

G = H 0 - f SdT 
Jo 


which if S is replaced by / Q r (G p /T)dT becomes 


G = H o 


• r [ f\c p /T)dT 
Jo IJo 


dT 


( 6 . 1 ) 


Equation (6.1) indicates that the free energy of a given 
phase decreases more rapidly with rise in tempera- 
ture the larger its specific heat. The intersection of the 
free energy -temperature curves, shown in Figure 3.9a, 
therefore takes place because the low-temperature 
phase has a smaller specific heat than the higher- 
temperature phase. 

At low temperatures the second term in equation 
(6.1) is relatively unimportant, and the phase that 
is stable is the one which has the lowest value 
of Hq, i.e. the most close-packed phase which is 
associated with a strong bonding of the atoms. 
However, the more strongly bound the phase, the 
higher is its elastic constant, the higher the vibrational 
frequency, and consequently the smaller the specific 
heat (see Figure 6.3a). Thus, the more weakly bound 
structure, i.e. the phase with the higher H 0 at low 
temperature, is likely to appear as the stable phase 
at higher temperatures. This is because the second 
term in equation (6.1) now becomes important and G 
decreases more rapidly with increasing temperature, 
for the phase with the largest value of f (C p /T)dT. 
From Figure 6.3b it is clear that a large f (C p /T)dT 
is associated with a low characteristic temperature 
and hence, with a low vibrational frequency such as 
is displayed by a metal with a more open structure 
and small elastic strength. In general, therefore, when 


phase changes occur the more close-packed structure 
usually exists at the low temperatures and the more 
open structures at the high temperatures. From this 
viewpoint a liquid, which possesses no long-range 
structure, has a higher entropy than any solid phase 
so that ultimately all metals must melt at a sufficiently 
high temperature, i.e. when the TS term outweighs the 
H term in the free energy equation. 

The sequence of phase changes in such metals as 
titanium, zirconium, etc. is in agreement with this pre- 
diction and, moreover, the alkali metals, lithium and 
sodium, which are normally bcc at ordinary temper- 
atures, can be transformed to fee at sub-zero temper- 
atures. It is interesting to note that iron, being bcc 
(a-iron) even at low temperatures and fee (/-iron) at 
high temperatures, is an exception to this rule. In this 
case, the stability of the bcc structure is thought to be 
associated with its ferromagnetic properties. By hav- 
ing a bcc structure the interatomic distances are of the 
correct value for the exchange interaction to allow the 
electrons to adopt parallel spins (this is a condition for 
magnetism). While this state is one of low entropy it is 
also one of minimum internal energy, and in the lower 
temperature ranges this is the factor which governs the 
phase stability, so that the bcc structure is preferred. 

Iron is also of interest because the bcc structure, 
which is replaced by the fee structure at temperatures 
above 910°C, reappears as the 5-phase above 1400°C. 
This behaviour is attributed to the large electronic spe- 
cific heat of iron which is a characteristic feature of 
most transition metals. Thus, the Debye characteristic 
temperature of /-iron is lower than that of o'-iron and 
this is mainly responsible for the a to / transformation. 
However, the electronic specific heat of the o'-phase 
becomes greater than that of the /-phase above about 
300° C and eventually at higher temperatures becomes 
sufficient to bring about the return to the bcc structure 
at 1400°C. 

6.4 Diffusion 

6.4.1 Diffusion laws 

Some knowledge of diffusion is essential in 
understanding the behaviour of materials, particularly 
at elevated temperatures. A few examples include 
such commercially important processes as annealing, 
heat-treatment, the age-hardening of alloys, sintering, 
surface-hardening, oxidation and creep. Apart from 
the specialized diffusion processes, such as grain 
boundary diffusion and diffusion down dislocation 
channels, a distinction is frequently drawn between 
diffusion in pure metals, homogeneous alloys and 
inhomogeneous alloys. In a pure material self-diffusion 
can be observed by using radioactive tracer atoms. 
In a homogeneous alloy diffusion of each component 
can also be measured by a tracer method, but in an 
inhomogeneous alloy, diffusion can be determined by 
chemical analysis merely from the broadening of the 
interface between the two metals as a function of time. 
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Figure 6.5 Effect of diffusion on the distribution of solute in 
an alloy. 

Inhomogeneous alloys are common in metallurgical 
practice (e.g. cored solid solutions) and in such 
cases diffusion always occurs in such a way as to 
produce a macroscopic flow of solute atoms down the 
concentration gradient. Thus, if a bar of an alloy, along 
which there is a concentration gradient (Figure 6.5) is 
heated for a few hours at a temperature where atomic 
migration is fast, i.e. near the melting point, the solute 
atoms are redistributed until the bar becomes uniform 
in composition. This occurs even though the individual 
atomic movements are random, simply because there 
are more solute atoms to move down the concentration 
gradient than there are to move up. This fact forms the 
basis of Fick’ s law of diffusion, which is 


n ^ atoms 
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Figure 6.6 Diffusion of atoms down a concentration 
gradient. 


with J x the flux of diffusing atoms. Setting c\ — c 2 = 
— b(dc/dx) this flux becomes 

J x = — p x v v b 2 (dc/dx) = — \vb 2 (dc/dx) 


dn/dt = — Ddc/dx 


(6.2) = ~D(dc/dx) 


(6.3) 


Here the number of atoms diffusing in unit time 
across unit area through a unit concentration gradient 
is known as the diffusivity or diffusion coefficient, 1 D. 
It is usually expressed as units of cm 2 s -1 or m 2 s _1 and 
depends on the concentration and temperature of the 
alloy. 

To illustrate, we may consider the flow of atoms 
in one direction x, by taking two atomic planes A 
and B of unit area separated by a distance b, as 
shown in Figure 6.6. If c\ and c 2 are the concentrations 
of diffusing atoms in these two planes (c\ > c 2 ) the 
corresponding number of such atoms in the respective 
planes is ri\ = c\b and n 2 = c 2 b. If the probability 
that any one jump in the +x direction is p x , then 
the number of jumps per unit time made by one atom 
is p x v, where v is the mean frequency with which 
an atom leaves a site irrespective of directions. The 
number of diffusing atoms leaving A and arriving at 
B in unit time is ( p x vc\b ) and the number making the 
reverse transition is ( p x vc 2 b ) so that the net gain of 
atoms at B is 


In cubic lattices, diffusion is isotropic and hence all six 
orthogonal directions are equally likely so that p x = | . 
For simple cubic structures b = a and thus 

D x = D y = D z = ^va 2 = D (6.4) 


whereas in fee structures b = a/\fl and D = j^va 2 , 
and in bcc structures D = y^va 2 . 

Fick’s first law only applies if a steady state exists 
in which the concentration at every point is invariant, 
i.e. ( dc/dt ) = 0 for all x. To deal with nonstationary 
flow in which the concentration at a point changes 
with time, we take two planes A and B, as before, 
separated by unit distance and consider the rate of 
increase of the number of atoms (dc/dt) in a unit 
volume of the specimen; this is equal to the difference 
between the flux into and that out of the volume 
element. The flux across one plane is J x and across the 
other (J x + \)dJ /dx the difference being — (dJ/dx). 
We thus obtain Fick’s second law of diffusion 


dc 

dJ x 

d 

/ dc 

— — 

— 

— — 

Dy 

dt 

dx 

dx 

V dx 


(6.5) 


p x vb(c x - c 2 ) = J x 


1 The conduction of heat in a still medium also follows the 
same laws as diffusion. 


When D is independent of concentration this reduces 
to 


dc x d 2 c 

£ — n 

d t X dx 2 


( 6 . 6 ) 
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and in three dimensions becomes 


dc d / dc\ d / dc\ d 
d t dx \ x dx) + dy \ y dy ) + dz 



An illustration of the use of the diffusion equations 
is the behaviour of a diffusion couple, where there 
is a sharp interface between pure metal and an alloy. 
Figure 6.5 can be used for this example and as the 
solute moves from alloy to the pure metal the way in 
which the concentration varies is shown by the dotted 
lines. The solution to Fick’s second law is given by 


c = 


co 

2 


2 rx/VVm] 

1 j= / exp(— y 2 )dy 

V 71 7 o 


(6.7) 


where c 0 is the initial solute concentration in the alloy 
and c is the concentration at a time r at a distance 
v from the interface. The integral term is known as 
the Gauss error function (erf (y)) and as y -> oo, 
erf (y) 1. It will be noted that at the interface where 

v = 0, then c = c 0 /2, and in those regions where the 
curvature d 2 c/dx 2 is positive the concentration rises, 
in those regions where the curvature is negative the 
concentration falls, and where the curvature is zero 
the concentration remains constant. 

This particular example is important because it can 
be used to model the depth of diffusion after time 
t , e.g. in the case-hardening of steel, providing the 
concentration profile of the carbon after a carburizing 
time t , or dopant in silicon. Starting with a constant 
composition at the surface, the value of x where 
the concentration falls to half the initial value, i.e. 
1 — erf(y) = is given by x = V Jpi )• Thus knowing 
D at a given temperature the time to produce a given 
depth of diffusion can be estimated. 

The diffusion equations developed above can also be 
transformed to apply to particular diffusion geometries. 
If the concentration gradient has spherical symmetry 
about a point, c varies with the radial distance r and, 
for constant D , 


dc _ / d 2 c 2 dc\ 
d t V dr 2 r dr ) 


( 6 . 8 ) 


When the diffusion field has radial symmetry about a 
cylindrical axis, the equation becomes 


dc / d 2 c 1 dc\ 

dt \ dr 2 r dr ) 


(6.9) 


and the steady- state condition ( dc/dt ) = 0 is given by 


d 2 c 1 dc 
dr 2 r dr 


( 6 . 10 ) 


which has a solution c = Alnr + B. The constants A 
and B may be found by introducing the appropriate 
boundary conditions and for c = c 0 at r = r 0 and 
c = ci at r = r\ the solution becomes 

c 0 ln(ri/r) + ci ln(r/r 0 ) 

c = 

ln(ri/r 0 ) 


The flux through any shell of radius r is — 2nrD(dc/dr) 
or 


J 


2 ttD 
ln(r i/r 0 ) 


(ci - c 0 ) 


( 6 . 11 ) 


Diffusion equations are of importance in many diverse 
problems and in Chapter 4 are applied to the diffusion 
of vacancies from dislocation loops and the sintering 
of voids. 


6.4.2 Mechanisms of diffusion 

The transport of atoms through the lattice may conceiv- 
ably occur in many ways. The term ‘interstitial diffu- 
sion’ describes the situation when the moving atom 
does not lie on the crystal lattice, but instead occu- 
pies an interstitial position. Such a process is likely 
in interstitial alloys where the migrating atom is very 
small (e.g. carbon, nitrogen or hydrogen in iron). In 
this case, the diffusion process for the atoms to move 
from one interstitial position to the next in a perfect 
lattice is not defect-controlled. A possible variant of 
this type of diffusion has been suggested for substitu- 
tional solutions in which the diffusing atoms are only 
temporarily interstitial and are in dynamic equilibrium 
with others in substitutional positions. However, the 
energy to form such an interstitial is many times that to 
produce a vacancy and, consequently, the most likely 
mechanism is that of the continual migration of vacan- 
cies. With vacancy diffusion, the probability that an 
atom may jump to the next site will depend on: (1) the 
probability that the site is vacant (which in turn is pro- 
portional to the fraction of vacancies in the crystal), 
and (2) the probability that it has the required activa- 
tion energy to make the transition. For self-diffusion 
where no complications exist, the diffusion coefficient 
is therefore given by 

D=yfvexp[(S t + S m )/k] 

x exp [— Ef/kT] exp [— E m /kT] 

= D 0 exp [-(£ f + E m )/kT] (6.12) 

The factor / appearing in D 0 is known as a correla- 
tion factor and arises from the fact that any particular 
diffusion jump is influenced by the direction of the 
previous jump. Thus when an atom and a vacancy 
exchange places in the lattice there is a greater prob- 
ability of the atom returning to its original site than 
moving to another site, because of the presence there 
of a vacancy; / is 0.80 and 0.78 for fee and bcc 
lattices, respectively. Values for E f and E m are dis- 
cussed in Chapter 4, Ef is the energy of formation of 
a vacancy, E m the energy of migration, and the sum 
of the two energies, Q = Ef + E m , is the activation 
energy for self-diffusion 1 E d . 

! The entropy factor exp [(Sf + S m )/k] is usually taken to be 
unity. 
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In alloys, the problem is not so simple and it is 
found that the self-diffusion energy is smaller than in 
pure metals. This observation has led to the sugges- 
tion that in alloys the vacancies associate preferentially 
with solute atoms in solution; the binding of vacancies 
to the impurity atoms increases the effective vacancy 
concentration near those atoms so that the mean jump 
rate of the solute atoms is much increased. This asso- 
ciation helps the solute atom on its way through the 
lattice, but, conversely, the speed of vacancy migration 
is reduced because it lingers in the neighbourhood of 
the solute atoms, as shown in Figure 6.7. The phe- 
nomenon of association is of fundamental importance 
in all kinetic studies since the mobility of a vacancy 
through the lattice to a vacancy sink will be governed 
by its ability to escape from the impurity atoms which 
trap it. This problem has been mentioned in Chapter 4. 

When considering diffusion in alloys it is impor- 
tant to realize that in a binary solution of A and B 
the diffusion coefficients D A and D B are generally not 
equal. This inequality of diffusion was first demon- 
strated by Kirkendall using an a-brass/copper couple 
(Figure 6.8). He noted that if the position of the inter- 
faces of the couple were marked (e.g. with fine W or 
Mo wires), during diffusion the markers move towards 
each other, showing that the zinc atoms diffuse out of 
the alloy more rapidly than copper atoms diffuse in. 
This being the case, it is not surprising that several 
workers have shown that porosity develops in such 
systems on that side of the interface from which there 
is a net loss of atoms. 

The Kirkendall effect is of considerable theoretical 
importance since it confirms the vacancy mechanism 
of diffusion. This is because the observations cannot 
easily be accounted for by any other postulated 
mechanisms of diffusion, such as direct place- 
exchange, i.e. where neighbouring atoms merely 
change place with each other. The Kirkendall effect 
is readily explained in terms of vacancies since the 
lattice defect may interchange places more frequently 
with one atom than the other. The effect is also of 
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Figure 6.7 Solute atom-vacancy association during 
diffusion. 
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Figure 6.8 a-brass-copper couple for demonstrating the 
Kirkendall effect. 


some practical importance, especially in the fields of 
metal-to-metal bonding, sintering and creep. 

6.4.3 Factors affecting diffusion 

The two most important factors affecting the diffu- 
sion coefficient D are temperature and composition. 
Because of the activation energy term the rate of diffu- 
sion increases with temperature according to equation 
(6.12), while each of the quantities D , D 0 and Q 
varies with concentration; for a metal at high temper- 
atures Q « 20 RT m , D 0 is 10 -5 to 10 -3 m 2 s -1 , and 
D — 10 -12 m 2 s -1 . Because of this variation of diffu- 
sion coefficient with concentration, the most reliable 
investigations into the effect of other variables neces- 
sarily concern self-diffusion in pure metals. 

Diffusion is a structure- sensitive property and, 
therefore, D is expected to increase with increasing 
lattice irregularity. In general, this is found experi- 
mentally. In metals quenched from a high temper- 
ature the excess vacancy concentration ~10 9 leads 
to enhanced diffusion at low temperatures since D = 
D 0 c v exp (— E m /kT ). Grain boundaries and disloca- 
tions are particularly important in this respect and 
produce enhanced diffusion. Diffusion is faster in the 
cold-worked state than in the annealed state, although 
recrystallization may take place and tend to mask the 
effect. The enhanced transport of material along dislo- 
cation channels has been demonstrated in aluminium 
where voids connected to a free surface by dislo- 
cations anneal out at appreciably higher rates than 
isolated voids. Measurements show that surface and 
grain boundary forms of diffusion also obey Arrhe- 
nius equations, with lower activation energies than 
for volume diffusion, i.e. Q vo \ > 2 Q g _ b > 20 surface . This 
behaviour is understandable in view of the progres- 
sively more open atomic structure found at grain 
boundaries and external surfaces. It will be remem- 
bered, however, that the relative importance of the 
various forms of diffusion does not entirely depend on 
the relative activation energy or diffusion coefficient 
values. The amount of material transported by any dif- 
fusion process is given by Fick’s law and for a given 
composition gradient also depends on the effective area 
through which the atoms diffuse. Consequently, since 
the surface area (or grain boundary area) to volume 
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ratio of any polycrystalline solid is usually very small, 
it is only in particular phenomena (e.g. sintering, oxi- 
dation, etc.) that grain boundaries and surfaces become 
important. It is also apparent that grain boundary diffu- 
sion becomes more competitive, the finer the grain and 
the lower the temperature. The lattice feature follows 
from the lower activation energy which makes it less 
sensitive to temperature change. As the temperature 
is lowered, the diffusion rate along grain boundaries 
(and also surfaces) decreases less rapidly than the dif- 
fusion rate through the lattice. The importance of grain 
boundary diffusion and dislocation pipe diffusion is 
discussed again in Chapter 7 in relation to deformation 
at elevated temperatures, and is demonstrated con- 
vincingly on the deformation maps (see Figure 7.68), 
where the creep field is extended to lower temperatures 
when grain boundary (Coble creep) rather than lattice 
diffusion (Herring -Nabarro creep) operates. 

Because of the strong binding between atoms, pres- 
sure has little or no effect but it is observed that with 
extremely high pressure on soft metals (e.g. sodium) 
an increase in Q may result. The rate of diffusion 
also increases with decreasing density of atomic pack- 
ing. For example, self-diffusion is slower in fee iron 
or thallium than in bcc iron or thallium when the 
results are compared by extrapolation to the transfor- 
mation temperature. This is further emphasized by the 
anisotropic nature of D in metals of open structure. 
Bismuth (rhombohedral) is an example of a metal in 
which D varies by 10 6 for different directions in the 
lattice; in cubic crystals D is isotropic. 


6.5 Anelasticity and internal friction 

For an elastic solid it is generally assumed that stress 
and strain are directly proportional to one another, but 
in practice the elastic strain is usually dependent on 
time as well as stress so that the strain lags behind the 
stress; this is an anelastic effect. On applying a stress at 
a level below the conventional elastic limit, a specimen 
will show an initial elastic strain £ e followed by a 
gradual increase in strain until it reaches an essentially 
constant value, s Q + s an as shown in Figure 6.9. When 
the stress is removed the strain will decrease, but a 
small amount remains which decreases slowly with 
time. At any time t the decreasing anelastic strain is 
given by the relation s = s an exp {—t/ r) where r is 
known as the relaxation time, and is the time taken 
for the anelastic strain to decrease to 1/e ~ 36.79% of 
its initial value. Clearly, if r is large, the strain relaxes 
very slowly, while if small the strain relaxes quickly. 

In materials under cyclic loading this anelastic effect 
leads to a decay in amplitude of vibration and therefore 
a dissipation of energy by internal friction. Internal 
friction is defined in several different but related ways. 
Perhaps the most common uses the logarithmic decre- 
ment 8 = ln(A„/A n+ i), the natural logarithm of suc- 
cessive amplitudes of vibration. In a forced vibration 
experiment near a resonance, the factor (co 2 — (D\)/cdo 



Figure 6.9 Anelastic behaviour. 


is often used, where a>\ and co 2 are the frequencies on 
the two sides of the resonant frequency co 0 at which 
the amplitude of oscillation is 1/V2 of the resonant 
amplitude. Also used is the specific damping capacity 
A E/E, where A E is the energy dissipated per cycle 
of vibrational energy E , i.e. the area contained in a 
stress -strain loop. Yet another method uses the phase 
angle a by which the strain lags behind the stress, and 
if the damping is small it can be shown that 


tan a = 


8 
7 1 


1 A E 
2tt1T 


C0 2 - CD\ 
(D 0 


= Q~ l 


(6.13) 


By analogy with damping in electrical systems tan a 
is often written equal to Q~ l . 

There are many causes of internal friction arising 
from the fact that the migration of atoms, lattice 
defects and thermal energy are all time-dependent 
processes. The latter gives rise to thermoelasticity and 
occurs when an elastic stress is applied to a specimen 
too fast for the specimen to exchange heat with its 
surroundings and so cools slightly. As the sample 
warms back to the surrounding temperature it expands 
thermally, and hence the dilatation strain continues to 
increase after the stress has become constant. 

The diffusion of atoms can also give rise to 
anelastic effects in an analogous way to the diffusion 
of thermal energy giving thermoelastic effects. A 
particular example is the stress-induced diffusion of 
carbon or nitrogen in iron. A carbon atom occupies 
the interstitial site along one of the cell edges slightly 
distorting the lattice tetragonally. Thus when iron 
is stretched by a mechanical stress, the crystal axis 
oriented in the direction of the stress develops favoured 
sites for the occupation of the interstitial atoms 
relative to the other two axes. Then if the stress is 
oscillated, such that first one axis and then another is 
stretched, the carbon atoms will want to jump from 
one favoured site to the other. Mechanical work is 
therefore done repeatedly, dissipating the vibrational 
energy and damping out the mechanical oscillations. 
The maximum energy is dissipated when the time per 
cycle is of the same order as the time required for the 
diffusional jump of the carbon atom. 
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Figure 6.10 Schematic diagram of a Ke torsion pendulum. 


The simplest and most convenient way of studying 
this form of internal friction is by means of a Ke 
torsion pendulum, shown schematically in Figure 6.10. 
The specimen can be oscillated at a given frequency 
by adjusting the moment of inertia of the torsion bar. 
The energy loss per cycle A E/E varies smoothly with 
the frequency according to the relation 
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and has a maximum value when the angular frequency 
of the pendulum equals the relaxation time of the 
process; at low temperatures around room temperature 
this is interstitial diffusion. In practice, it is difficult to 
vary the angular frequency over a wide range and thus 
it is easier to keep co constant and vary the relaxation 
time. Since the migration of atoms depends strongly on 
temperature according to an Arrhenius-type equation, 
the relaxation time X\ — l/co\ and the peak occurs 
at a temperature T\. For a different frequency value 
C 02 the peak occurs at a different temperature T 2 , and 
so on (see Figure 6.11). It is thus possible to ascribe 
an activation energy AH for the internal process 
producing the damping by plotting In r versus 1 /T, 
or from the relation 


AH = R 


ln(co 2 /a)i) 

Wi - 1/T 2 


In the case of iron the activation energy is found to 
coincide with that for the diffusion of carbon in iron. 
Similar studies have been made for other metals. In 
addition, if the relaxation time is r the mean time 
an atom stays in an interstitial position is (|)r, and 
from the relation D = ^ a 2 v for bcc lattices derived 
previously the diffusion coefficient may be calculated 
directly from 


1 


36 



Many other forms of internal friction exist in met- 
als arising from different relaxation processes to those 
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Figure 6.11 Internal friction as a function of temperature 
for Fe with C in solid solution at five different pendulum 
frequencies (from Wert and Zener, 1949; by permission of 
the American Institute of Physics). 


discussed above, and hence occurring in different fre- 
quency and temperature regions. One important source 
of internal friction is that due to stress relaxation across 
grain boundaries. The occurrence of a strong internal 
friction peak due to grain boundary relaxation was first 
demonstrated on poly crystalline aluminium at 300°C 
by Ke and has since been found in numerous other 
metals. It indicates that grain boundaries behave in 
a somewhat viscous manner at elevated temperatures 
and grain boundary sliding can be detected at very low 
stresses by internal friction studies. The grain boundary 
sliding velocity produced by a shear stress r is given 
by v = rd/q and its measurement gives values of the 
viscosity q which extrapolate to that of the liquid at 
the melting point, assuming the boundary thickness to 
be d — 0.5 nm. 

Movement of low-energy twin boundaries in crys- 
tals, domain boundaries in ferromagnetic materials and 
dislocation bowing and unpinning all give rise to inter- 
nal friction and damping. 


6.6 Ordering in alloys 

6.6.1 Long-range and short-range order 

An ordered alloy may be regarded as being made up 
of two or more interpenetrating sub-lattices, each con- 
taining different arrangements of atoms. Moreover, the 
term ‘ superlattice ’ would imply that such a coher- 
ent atomic scheme extends over large distances, i.e. 
the crystal possesses long-range order. Such a perfect 
arrangement can exist only at low temperatures, since 
the entropy of an ordered structure is much lower than 
that of a disordered one, and with increasing tempera- 
ture the degree of long-range order, S, decreases until 
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at a critical temperature T c it becomes zero; the general 
form of the curve is shown in Figure 6.12. Partially- 
ordered structures are achieved by the formation of 
small regions (domains) of order, each of which are 
separated from each other by domain or anti-phase 
domain boundaries, across which the order changes 
phase (Figure 6.13). However, even when long-range 
order is destroyed, the tendency for unlike atoms to be 
neighbours still exists, and short-range order results 
above T c . The transition from complete disorder to 
complete order is a nucleation and growth process and 
may be likened to the annealing of a cold-worked 
structure. At high temperatures well above T c , there 
are more than the random number of AB atom pairs, 
and with the lowering of temperature small nuclei 
of order continually form and disperse in an other- 
wise disordered matrix. As the temperature, and hence 
thermal agitation, is lowered these regions of order 
become more extensive, until at T c they begin to link 
together and the alloy consists of an interlocking mesh 
of small ordered regions. Below T c these domains 
absorb each other (cf. grain growth) as a result of 
antiphase domain boundary mobility until long-range 
order is established. 

Some order- disorder alloys can be retained in a 
state of disorder by quenching to room temperature 
while in others (e.g. /3-brass) the ordering process 
occurs almost instantaneously. Clearly, changes in the 
degree of order will depend on atomic migration, so 
that the rate of approach to the equilibrium configu- 
ration will be governed by an exponential factor of 
the usual form, i.e. Rate = Ae~ Q/RT . However, Bragg 



Figure 6.12 Influence of temperature on the degree of order. 
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Figure 6.13 An antiphase domain boundary. 

has pointed out that the ease with which interlocking 
domains can absorb each other to develop a scheme 
of long-range order will also depend on the number of 
possible ordered schemes the alloy possesses. Thus, in 
/3-brass only two different schemes of order are possi- 
ble, while in fee lattices such as CU 3 AU four different 
schemes are possible and the approach to complete 
order is less rapid. 

6.6.2 Detection of ordering 

The determination of an ordered superlattice is usu- 
ally done by means of the X-ray powder technique. In 
a disordered solution every plane of atoms is statisti- 
cally identical and, as discussed in Chapter 5, there are 
reflections missing in the powder pattern of the mate- 
rial. In an ordered lattice, on the other hand, alternate 
planes become A-rich and B-rich, respectively, so that 
these ‘absent’ reflections are no longer missing but 
appear as extra superlattice lines. This can be seen 
from Figure 6.14: while the diffracted rays from the 
A planes are completely out of phase with those from 
the B planes their intensities are not identical, so that 
a weak reflection results. 

Application of the structure factor equation indicates 
that the intensity of the superlattice lines is 
proportional to \F 2 \ = 5 2 (/ A — / B ) 2 , from which 
it can be seen that in the fully-disordered alloy, 
where 5 = 0, the superlattice lines must vanish. In 
some alloys such as copper- gold, the scattering 
factor difference (/ A — / B ) is appreciable and the 
superlattice lines are, therefore, quite intense and 
easily detectable. In other alloys, however, such 
as iron-cobalt, nickel -manganese, copper-zinc, the 
term (/ A — / B ) is negligible for X-rays and the 
super-lattice lines are very weak; in copper- zinc, for 



Figure 6.14 Formation of a weak 100 reflection from an ordered lattice by the interference of diffracted rays of unequal 
amplitude. 
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example, the ratio of the intensity of the superlattice 
lines to that of the main lines is only about 1:3500. 
In some cases special X-ray techniques can enhance 
this intensity ratio; one method is to use an X- 
ray wavelength near to the absorption edge when 
an anomalous depression of the /-factor occurs 
which is greater for one element than for the other. 
As a result, the difference between f A and / B is 
increased. A more general technique, however, is to 
use neutron diffraction since the scattering factors 
for neighbouring elements in the Periodic Table can 
be substantially different. Conversely, as Table 5.4 
indicates, neutron diffraction is unable to show the 
existence of superlattice lines in Cu 3 Au, because the 
scattering amplitudes of copper and gold for neutrons 
are approximately the same, although X-rays show 
them up quite clearly. 

Sharp superlattice lines are observed as long as 
order persists over lattice regions of about 10 -3 mm, 
large enough to give coherent X-ray reflections. When 
long-range order is not complete the superlattice lines 
become broadened, and an estimate of the domain 



Figure 6.15 Degree of order (x) and domain size (O) 
during isothermal annealing at 350 0 C after quenching from 
465° C (after Morris, Besag and Smallman, 1974; courtesy 
of Taylor and Francis ). 


size can be obtained from a measurement of the line 
breadth, as discussed in Chapter 5. Figure 6.15 shows 
variation of order S and domain size as determined 
from the intensity and breadth of powder diffraction 
lines. The domain sizes determined from the Scherrer 
line-broadening formula are in very good agreement 
with those observed by TEM. Short-range order is 
much more difficult to detect but nowadays direct 
measuring devices allow weak X-ray intensities to be 
measured more accurately, and as a result considerable 
information on the nature of short-range order has 
been obtained by studying the intensity of the diffuse 
background between the main lattice lines. 

High-resolution transmission microscopy of thin 
metal foils allows the structure of domains to be exam- 
ined directly. The alloy CuAu is of particular interest, 
since it has a face-centred tetragonal structure, often 
referred to as CuAu 1 below 380°C, but between 380°C 
and the disordering temperature of 410°C it has the 
CuAu 11 structures shown in Figure 6.16. The (002) 
planes are again alternately gold and copper, but half- 
way along the ( 2 -axis of the unit cell the copper atoms 
switch to gold planes and vice versa. The spacing 
between such periodic anti-phase domain boundaries 
is 5 unit cells or about 2 nm, so that the domains are 
easily resolvable in TEM, as seen in Figure 6.17a. The 
isolated domain boundaries in the simpler superlat- 
tice structures such as CuAu 1, although not in this 
case periodic, can also be revealed by electron micro- 
scope, and an example is shown in Figure 6.17b. Apart 
from static observations of these superlattice struc- 
tures, annealing experiments inside the microscope 
also allow the effect of temperature on the structure to 
be examined directly. Such observations have shown 
that the transition from CuAu 1 to CuAu 11 takes 
place, as predicted, by the nucleation and growth of 
anti-phase domains. 

6.6.3 Influence of ordering on properties 

Specific heat The order- disorder transformation has 
a marked effect on the specific heat, since energy 
is necessary to change atoms from one configuration 
to another. However, because the change in lattice 
arrangement takes place over a range of temperature, 
the specific heat versus temperature curve will be of the 
form shown in Figure 6.4b. In practice the excess spe- 
cific heat, above that given by Dulong and Petit’s law, 
does not fall sharply to zero at T c owing to the exis- 
tence of short-range order, which also requires extra 
energy to destroy it as the temperature is increased 
above T c . 
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Figure 6.16 One unit cell of the orthorhombic superlattice of CuAu, i.e. CuAu 11 (from J. Inst. Metals, 1958-9, courtesy of 
the Institute of Metals). 
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Figure 6.17 Electron micrographs of (a) CuAu 11 and 
(b) CuAu 1 ( from Pashley and Presland, 1958-9; courtesy 
of the Institute of Metals). 


contribution to the electrical resistance. Accordingly, 
superlattices below T c have a low electrical resistance, 
but on raising the temperature the resistivity increases, 
as shown in Figure 6.18a for ordered Cu 3 Au. The 
influence of order on resistivity is further demonstrated 
by the measurement of resistivity as a function of com- 
position in the copper- gold alloy system. As shown in 
Figure 6.18b, at composition near Cu 3 Au and CuAu, 
where ordering is most complete, the resistivity is 
extremely low, while away from these stoichiomet- 
ric compositions the resistivity increases; the quenched 
(disordered) alloys given by the dotted curve also have 
high resistivity values. 

Mechanical properties The mechanical properties 
are altered when ordering occurs. The change in yield 
stress is not directly related to the degree of ordering, 
however, and in fact Cu 3 Au crystals have a lower yield 
stress when well-ordered than when only partially- 
ordered. Experiments show that such effects can be 
accounted for if the maximum strength as a result of 
ordering is associated with critical domain size. In the 
alloy Cu 3 Au, the maximum yield strength is exhibited 
by quenched samples after an annealing treatment of 5 
min at 350°C which gives a domain size of 6 nm (see 
Figure 6.15). However, if the alloy is well-ordered and 
the domain size larger, the hardening is insignificant. In 
some alloys such as CuAu or CuPt, ordering produces 
a change of crystal structure and the resultant lattice 
strains can also lead to hardening. Thermal agitation 
is the most common means of destroying long-range 
order, but other methods (e.g. deformation) are equally 
effective. Figure 6.18c shows that cold work has a 
negligible effect upon the resistivity of the quenched 
(disordered) alloy but considerable influence on the 
well-annealed (ordered) alloy. Irradiation by neutrons 
or electrons also markedly affects the ordering (see 
Chapter 4). 


Electrical resistivity As discussed in Chapter 4, any 
form of disorder in a metallic structure (e.g. impuri- 
ties, dislocations or point defects) will make a large 


Magnetic properties The order- disorder pheno- 
menon is of considerable importance in the application 
of magnetic materials. The kind and degree of order 
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Figure 6.18 Effect of (a) temperature, (b) composition, and (c) deformation on the resistivity of copper-gold alloys (after 
Barrett, 1952; courtesy of McGraw-Hill). 
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affects the magnetic hardness, since small ordered 
regions in an otherwise disordered lattice induce 
strains which affect the mobility of magnetic domain 
boundaries (see Section 6.8.4). 

6.7 Electrical properties 

6.7.1 Electrical conductivity 

One of the most important electronic properties of met- 
als is the electrical conductivity, /c, and the reciprocal 
of the conductivity (known as the resistivity, p) is 
defined by the relation R = pi /A, where R is the resis- 
tance of the specimen, / is the length and A is the 
cross-sectional area. 

A characteristic feature of a metal is its high electri- 
cal conductivity which arises from the ease with which 
the electrons can migrate through the lattice. The high 
thermal conduction of metals also has a similar expla- 
nation, and the Wiedmann- Franz law shows that the 
ratio of the electrical and thermal conductivities is 
nearly the same for all metals at the same temperature. 

Since conductivity arises from the motion of con- 
duction electrons through the lattice, resistance must be 
caused by the scattering of electron waves by any kind 
of irregularity in the lattice arrangement. Irregularities 
can arise from any one of several sources, such as tem- 
perature, alloying, deformation or nuclear irradiation, 
since all will disturb, to some extent, the periodicity 
of the lattice. The effect of temperature is particularly 
important and, as shown in Figure 6.19, the resistance 
increases linearly with temperature above about 100 K 
up to the melting point. On melting, the resistance 
increases markedly because of the exceptional disor- 
der of the liquid state. However, for some metals such 
as bismuth, the resistance actually decreases, owing 
to the fact that the special zone structure which makes 



Figure 6.19 Variation of resistivity with temperature. 


bismuth a poor conductor in the solid state is destroyed 
on melting. 

In most metals the resistance approaches zero at 
absolute zero, but in some (e.g. lead, tin and mer- 
cury) the resistance suddenly drops to zero at some 
finite critical temperature above 0 K. Such metals are 
called superconductors. The critical temperature is dif- 
ferent for each metal but is always close to absolute 
zero; the highest critical temperature known for an ele- 
ment is 8 K for niobium. Superconductivity is now 
observed at much higher temperatures in some inter- 
metallic compounds and in some ceramic oxides (see 
Section 6.7.4). 

An explanation of electrical and magnetic properties 
requires a more detailed consideration of electronic 
structure than that briefly outlined in Chapter 1 . There 
the concept of band structure was introduced and the 
electron can be thought of as moving continuously 
through the structure with an energy depending on the 
energy level of the band it occupies. The wave-like 
properties of the electron were also mentioned. For the 
electrons the regular array of atoms on the metallic 
lattice can behave as a three-dimensional diffraction 
grating since the atoms are positively-charged and 
interact with moving electrons. At certain wavelengths, 
governed by the spacing of the atoms on the metallic 
lattice, the electrons will experience strong diffraction 
effects, the results of which are that electrons having 
energies corresponding to such wavelengths will be 
unable to move freely through the structure. As a 
consequence, in the bands of electrons, certain energy 
levels cannot be occupied and therefore there will be 
energy gaps in the otherwise effectively continuous 
energy spectrum within a band. 

The interaction of moving electrons with the metal 
ions distributed on a lattice depends on the wavelength 
of the electrons and the spacing of the ions in the 
direction of movement of the electrons. Since the ionic 
spacing will depend on the direction in the lattice, the 
wavelength of the electrons suffering diffraction by the 
ions will depend on their direction. The kinetic energy 
of a moving electron is a function of the wavelength 
according to the relationship 

E = h 2 /2mA 2 (6.14) 

Since we are concerned with electron energies, it is 
more convenient to discuss interaction effects in terms 
of the reciprocal of the wavelength. This quantity is 
called the wave number and is denoted by k. 

In describing electron -lattice interactions it is usual 
to make use of a vector diagram in which the direction 
of the vector is the direction of motion of the moving 
electron and its magnitude is the wave number of 
the electron. The vectors representing electrons having 
energies which, because of diffraction effects, cannot 
penetrate the lattice, trace out a three-dimensional 
surface known as a Brillouin zone. Figure 6.20a shows 
such a zone for a face-centred cubic lattice. It is made 
up of plane faces which are, in fact, parallel to the most 



182 Modern Physical Metallurgy and Materials Engineering 



widely-spaced planes in the lattice, i.e. in this case the 
{111} and {200} planes. This is a general feature of 
Brillouin zones in all lattices. 

For a given direction in the lattice, it is possible to 
consider the form of the electron energies as a function 
of wave number. The relationship between the two 
quantities as given from equation (6.14) is 

E = h 2 k 2 /2m (6.15) 

which leads to the parabolic relationship shown as a 
broken line in Figure 6.20b. Because of the existence 
of a Brillouin zone at a certain value of k , depending 
on the lattice direction, there exists a range of energy 
values which the electrons cannot assume. This pro- 
duces a distortion in the form of the E-k curve in the 
neighbourhood of the critical value of k and leads to 
the existence of a series of energy gaps, which cannot 
be occupied by electrons. The E-k curve showing this 
effect is given as a continuous line in Figure 6.20b. 

The existence of this distortion in the E-k curve, 
due to a Brillouin zone, is reflected in the density 
of states versus energy curve for the free electrons. 
As previously stated, the density of states -energy 
curve is parabolic in shape, but it departs from this 
form at energies for which Brillouin zone interactions 
occur. The result of such interactions is shown in 
Figure 6.21a in which the broken line represents the 
N(E)-E curve for free electrons in the absence of 
zone effects and the full line is the curve where a 
zone exists. The total number of electrons needed to 
fill the zone of electrons delineated by the full line 
in Figure 6.21a is 2 N, where N is the total number 
of atoms in the metal. Thus, a Brillouin zone would 
be filled if the metal atoms each contributed two 
electrons to the band. If the metal atoms contribute 
more than two per atom, the excess electrons must be 
accommodated in the second or higher zones. 





Figure 6.21 Schematic representation of Brillouin zones. 


In Figure 6.21a the two zones are separated by an 
energy gap, but in real metals this is not necessarily 
the case, and two zones can overlap in energy in the 
N(E)-E curves so that no such energy gaps appear. 
This overlap arises from the fact that the energy of 
the forbidden region varies with direction in the lattice 
and often the energy level at the top of the first zone 
has a higher value in one direction than the lowest 
energy level at the bottom of the next zone in some 
other direction. The energy gap in the N(E)-E curves, 
which represent the summation of electronic levels in 
all directions, is then closed (Figure 6.21b). 

For electrical conduction to occur, it is necessary 
that the electrons at the top of a band should be 
able to increase their energy when an electric field is 
applied to materials so that a net flow of electrons in 
the direction of the applied potential, which manifests 
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itself as an electric current, can take place. If an 
energy gap between two zones of the type shown 
in Figure 6.21a occurs, and if the lower zone is just 
filled with electrons, then it is impossible for any 
electrons to increase their energy by jumping into 
vacant levels under the influence of an applied electric 
field, unless the field strength is sufficiently great to 
supply the electrons at the top of the filled band with 
enough energy to jump the energy gap. Thus metallic 
conduction is due to the fact that in metals the number 
of electrons per atom is insufficient to fill the band up 
to the point where an energy gap occurs. In copper, for 
example, the 4s valency electrons fill only one half of 
the outer s-band. In other metals (e.g. Mg) the valency 
band overlaps a higher energy band and the electrons 
near the Fermi level are thus free to move into the 
empty states of a higher band. When the valency band 
is completely filled and the next higher band, separated 
by an energy gap, is completely empty, the material is 
either an insulator or a semiconductor. If the gap is 
several electron volts wide, such as in diamond where 
it is 7 eV, extremely high electric fields would be 
necessary to raise electrons to the higher band and the 
material is an insulator. If the gap is small enough, 
such as 1-2 eV as in silicon, then thermal energy 
may be sufficient to excite some electrons into the 
higher band and also create vacancies in the valency 
band, the material is a semiconductor. In general, the 
lowest energy band which is not completely filled with 
electrons is called a conduction band, and the band 
containing the valency electrons the valency band. For 
a conductor the valency band is also the conduction 
band. The electronic state of a selection of materials 
of different valencies is presented in Figure 6.21c. 
Although all metals are relatively good conductors of 
electricity, they exhibit among themselves a range 
of values for their resistivities. There are a number of 
reasons for this variability. The resistivity of a metal 
depends on the density of states of the most energetic 
electrons at the top of the band, and the shape of the 
N(E)-E curve at this point. 

In the transition metals, for example, apart from pro- 
ducing the strong magnetic properties, great strength 
and high melting point, the d - band is also responsi- 
ble for the poor electrical conductivity and high elec- 
tronic specific heat. When an electron is scattered by 
a lattice irregularity it jumps into a different quan- 
tum state, and it will be evident that the more vacant 
quantum states there are available in the same energy 
range, the more likely will be the electron to deflect 
at the irregularity. The high resistivities of the transi- 
tion metals may, therefore, be explained by the ease 
with which electrons can be deflected into vacant d- 
states. Phonon-assisted s-d scattering gives rise to the 
non-linear variation of p with temperature observed at 
high temperatures. The high electronic specific heat is 
also due to the high density of states in the unfilled d- 
band, since this gives rise to a considerable number of 
electrons at the top of the Fermi distribution which can 
be excited by thermal activation. In copper, of course, 


there are no unfilled levels at the top of the d - band 
into which electrons can go, and consequently both 
the electronic specific heat and electrical resistance is 
low. The conductivity also depends on the degree to 
which the electrons are scattered by the ions of the 
metal which are thermally vibrating, and by impurity 
atoms or other defects present in the metal. 

Insulators can also be modified either by the applica- 
tion of high temperatures or by the addition of impu- 
rities. Clearly, insulators may become conductors at 
elevated temperatures if the thermal agitation is suffi- 
cient to enable electrons to jump the energy gap into 
the unfilled zone above. 

6.7.2 Semiconductors 

Some materials have an energy gap small enough 
to be surmounted by thermal excitation. In such 
intrinsic semiconductors, as they are called, the current 
carriers are electrons in the conduction band and 
holes in the valency band in equal numbers. The 
relative position of the two bands is as shown in 
Figure 6.22. The motion of a hole in the valency 
band is equivalent to the motion of an electron in 
the opposite direction. Alternatively, conduction may 
be produced by the presence of impurities which 
either add a few electrons to an empty zone or 
remove a few from a full one. Materials which 
have their conductivity developed in this way are 
commonly known as semiconductors. Silicon and 
germanium containing small amounts of impurity have 
semiconducting properties at ambient temperatures 
and, as a consequence, they are frequently used in 
electronic transistor devices. Silicon normally has 
completely filled zones, but becomes conducting if 
some of the silicon atoms, which have four valency 
electrons, are replaced by phosphorus, arsenic or 
antimony atoms which have five valency electrons. 
The extra electrons go into empty zones, and as a 
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Figure 6.22 Schematic diagram of an intrinsic 
semiconductor showing the relative positions of the 
conduction and valency bands. 
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result silicon becomes an ft -type semiconductor, since 
conduction occurs by negative carriers. On the other 
hand, the addition of elements of lower valency than 
silicon, such as aluminium, removes electrons from 
the filled zones leaving behind ‘holes’ in the valency 
band structure. In this case silicon becomes a /7-type 
semiconductor, since the movement of electrons in one 
direction of the zone is accompanied by a movement 
of ‘holes’ in the other, and consequently they act 
as if they were positive carriers. The conductivity 
may be expressed as the product of (1) the number 
of charge carriers, ft, (2) the charge carried by each 
(i.e. e — 1.6 x 10“ 19 C) and (3) the mobility of the 
carrier, /x. 

A pentavalent impurity which donates conduction 
electrons without producing holes in the valency band 
is called a donor. The spare electrons of the impurity 
atoms are bound in the vicinity of the impurity atoms 
in energy levels known as the donor levels, which 
are near the conduction band. If the impurity exists 
in an otherwise intrinsic semiconductor the number of 
electrons in the conduction band become greater than 
the number of holes in the valency band and, hence, 
the electrons are the majority carriers and the holes the 
minority carriers. Such a material is an ft -type extrinsic 
semiconductor (see Figure 6.23a). 
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Figure 6.23 Schematic energy band structure of (a) n-type 
and (b) p-type semiconductor. 


Trivalent impurities in Si or Ge show the opposite 
behaviour leaving an empty electron state, or hole, 
in the valency band. If the hole separates from the 
so-called acceptor atom an electron is excited from 
the valency band to an acceptor level A E ~ 0.01 eV. 
Thus, with impurity elements such as Al, Ga or In 
creating holes in the valency band in addition to those 
created thermally, the majority carriers are holes and 
the semiconductor is of the /7-type extrinsic form 
(see Figure 6.23b). For a semiconductor where both 
electrons and holes carry current the conductivity is 
given by 

k = n t epL & + fth^Mh (6.16) 

where n e and ft h are, respectively, the volume con- 
centration of electrons and holes, and /x e and /x h the 
mobilities of the carriers, i.e. electrons and holes. 

Semiconductor materials are extensively used in 
electronic devices such as the p-n rectifying junction, 
transistor (a double-junction device) and the tunnel 
diode. Semiconductor regions of either p- or ft -type 
can be produced by carefully controlling the distribu- 
tion and impurity content of Si or Ge single crystals, 
and the boundary between p- and ft -type extrinsic 
semiconductor materials is called a p-n junction. Such 
a junction conducts a large current when the voltage is 
applied in one direction, but only a very small cur- 
rent when the voltage is reversed. The action of a 
p-n junction as a rectifier is shown schematically in 
Figure 6.24. The junction presents no barrier to the 
flow of minority carriers from either side, but since the 
concentration of minority carriers is low, it is the flow 
of majority carriers which must be considered. When 
the junction is biased in the forward direction, i.e. n- 
type made negative and the /7-type positive, the energy 
barrier opposing the flow of majority carriers from both 
sides of the junction is reduced. Excess majority car- 
riers enter the p and n regions, and these recombine 
continuously at or near the junction to allow large cur- 
rents to flow. When the junction is reverse-biased, the 
energy barrier opposing the flow of majority carriers 
is raised, few carriers move and little current flows. 

A transistor is essentially a single crystal with two 
p-n junctions arranged back to back to give either a 
p-n-p or n-p-n two-junction device. For a p-n-p 
device the main current flow is provided by the positive 
holes, while for a n-p-n device the electrons carry 
the current. Connections are made to the individual 
regions of the p-n-p device, designated emitter, base 
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Figure 6.24 Schematic illustration of p-n junction rectification with (a) forward bias and (b) reverse bias. 
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Figure 6.25 Schematic diagram of a p-n-p transistor. 


and collector respectively, as shown in Figure 6.25, 
and the base is made slightly negative and the collector 
more negative relative to the emitter. The emitter- 
base junction is therefore forward-biased and a strong 
current of holes passes through the junction into the 
n -layer which, because it is thin (10 -2 mm), largely 
reach the collector base junction without recombining 
with electrons. The collector-base junction is reverse- 
biased and the junction is no barrier to the passage of 
holes; the current through the second junction is thus 
controlled by the current through the first junction. 
A small increase in voltage across the emitter-base 
junction produces a large injection of holes into the 
base and a large increase in current in the collector, to 
give the amplifying action of the transistor. 

Many varied semiconductor materials such as InSb 
and GaAs have been developed apart from Si and Ge. 
However, in all cases very high purity and crystal 
perfection is necessary for efficient semiconducting 
operations and to produce the material, zone -refining 
techniques are used. Semiconductor integrated circuits 
are extensively used in micro-electronic equipment 
and these are produced by vapour deposition through 
masks on to a single Si- slice, followed by diffusion of 
the deposits into the base crystal. 

Doped ceramic materials are used in the construc- 
tion of thermistors , which are semiconductor devices 
with a marked dependence of electrical resistivity upon 
temperature. The change in resistance can be quite 
significant at the critical temperature. Positive temper- 
ature coefficient (PTC) thermistors are used as switch- 
ing devices, operating when a control temperature is 
reached during a heating process. PTC thermistors are 
commonly based on barium titanate. Conversely, NTC 
thermistors are based on oxide ceramics and can be 
used to signal a desired temperature change during 
cooling; the change in resistance is much more gradual 
and does not have the step-characteristic of the PTC 
types. 

Doped zinc oxide does not exhibit the linear volt- 
age/current relation that one expects from Ohm’s Law. 
At low voltage, the resistivity is high and only a small 
current flows. When the voltage increases there is a 
sudden decrease in resistance, allowing a heavier cur- 
rent to flow. This principle is adopted in the varistor , 
a voltage- sensitive on/off switch. It is wired in parallel 


with high-voltage equipment and can protect it from 
transient voltage ‘spikes’ or overload. 

6.7.3 Superconductivity 

At low temperatures (<20 K) some metals have zero 
electrical resistivity and become superconductors. This 
superconductivity disappears if the temperature of 
the metal is raised above a critical temperature T c , 
if a sufficiently strong magnetic held is applied or 
when a high current density hows. The critical held 
strength H c , current density J c and temperature T c are 
interdependent. Figure 6.26 shows the dependence of 
H c on temperature for a number of metals; metals with 
high T c and H c values, which include the transition 
elements, are known as hard superconductors, those 
with low values such as Al, Zn, Cd, Hg, white-Sn are 
soft superconductors. The curves are roughly parabolic 
and approximate to the relation H c = H 0 [1 — ( T/T c ) 2 ] 
where H 0 is the critical held at 0 K; H 0 is about 
1.6 x 10 5 A/m for Nb. 

Superconductivity arises from conduction elec- 
tron-electron attraction resulting from a distortion of 
the lattice through which the electrons are travelling; 
this is clearly a weak interaction since for most metals 
it is destroyed by thermal activation at very low tem- 
peratures. As the electron moves through the lattice 
it attracts nearby positive ions thereby locally caus- 
ing a slightly higher positive charge density. A nearby 
electron may in turn be attracted by the net positive 
charge, the magnitude of the attraction depending on 
the electron density, ionic charge and lattice vibrational 
frequencies such that under favourable conditions the 
effect is slightly stronger than the electrostatic repul- 
sion between electrons. The importance of the lattice 
ions in superconductivity is supported by the obser- 
vation that different isotopes of the same metal (e.g. 
Sn and Hg) have different T c values proportional to 
M -1/2 , where M is the atomic mass of the isotope. 
Since both the frequency of atomic vibrations and 
the velocity of elastic waves also varies as M -1/2 , 
the interaction between electrons and lattice vibrations 



Figure 6.26 Variation of critical field H c as a function of 
temperature for several pure metal superconductors. 
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(i.e. electron -phonon interaction) must be at least one 
cause of superconductivity. 

The theory of superconductivity indicates that the 
electron -electron attraction is strongest between elec- 
trons in pairs, such that the resultant momentum of 
each pair is exactly the same and the individual elec- 
trons of each pair have opposite spin. With this partic- 
ular form of ordering the total electron energy (i.e. 
kinetic and interaction) is lowered and effectively 
introduces a finite energy gap between this organized 
state and the usual more excited state of motion. The 
gap corresponds to a thin shell at the Fermi surface, 
but does not produce an insulator or semiconductor, 
because the application of an electric field causes the 
whole Fermi distribution, together with gap, to drift 
to an unsymmetrical position, so causing a current to 
flow. This current remains even when the electric field 
is removed, since the scattering which is necessary to 
alter the displaced Fermi distribution is suppressed. 

At 0 K all the electrons are in paired states but as 
the temperature is raised, pairs are broken by thermal 
activation giving rise to a number of normal electrons 
in equilibrium with the superconducting pairs. With 
increasing temperature the number of broken pairs 
increases until at T c they are finally eliminated together 
with the energy gap; the superconducting state then 
reverts to the normal conducting state. The supercon- 
ductivity transition is a second-order transformation 
and a plot of C/T as a function of T 2 deviates from 
the linear behaviour exhibited by normal conducting 
metals, the electronic contribution being zero at 0 K. 
The main theory of superconductivity, due to Bardeen, 
Cooper and Schrieffer (BCS) attempts to relate T c to 
the strength of the interaction potential, the density 
of states at the Fermi surface and to the average fre- 
quency of lattice vibration involved in the scattering, 
and provides some explanation for the variation of T c 
with the e/a ratio for a wide range of alloys, as shown 
in Figure 6.27. The main effect is attributable to the 


Empty Partly filled 

d-shell" d-shell 


Closed 

rf-shell 



Column of periodic table 

0 la 2a 3b 4 b 5b 6b 7b 8b 9b 10b 1b 2b 3a 4a 5a 6a 7a 0 


0 24 6 8 10 2 4 6 8 

Column number (averaged for compounds and alloys) 


Figure 6.27 The variation of T c with position in the 
periodic table (from Mathias, 1959, p. 138; courtesy of 
North-Holland Publishing Co.). 


change in density of states with e/a ratio. Supercon- 
ductivity is thus favoured in compounds of polyvalent 
atoms with crystal structures having a high density of 
states at the Fermi surface. Compounds with high T c 
values, such as Nb 3 Sn (18.1 K), Nb 3 Al (17.5 K), V 3 Si 
(17.0 K), V 3 Ga (16.8 K), all crystallize with the /3- 
tungsten structure and have an e/a ratio close to 4.7; 
T c is very sensitive to the degree of order and to devi- 
ation from the stoichiometric ratio, so values probably 
correspond to the non- stoichiometric condition. 

The magnetic behaviour of superconductivity is as 
remarkable as the corresponding electrical behaviour, 
as shown in Figure 6.28 by the Meissner effect for 
an ideal (structurally perfect) superconductor. It is 
observed for a specimen placed in a magnetic field 
(H < H c ), which is then cooled down below T c , that 
magnetic lines of force are pushed out. The specimen 
is a perfect diamagnetic material with zero inductance 
as well as zero resistance. Such a material is termed 
an ideal type I superconductor. An ideal type II super- 
conductor behaves similarly at low field strengths, with 
H < H c i < H c , but then allows a gradual penetration 
of the field returning to the normal state when pen- 
etration is complete at H > H c 2 > H c . In detail, the 
field actually penetrates to a small extent in type I 
superconductors when it is below H c and in type II 
superconductors when H is below H c 3 , and decays 
away at a penetration depth ~ 100- 10 nm. 

The observation of the Meissner effect in type I 
superconductors implies that the surface between the 
normal and superconducting phases has an effective 
positive energy. In the absence of this surface energy, 
the specimen would break up into separate fine regions 
of superconducting and normal material to reduce the 
work done in the expulsion of the magnetic flux. A 
negative surface energy exists between the normal 
and superconducting phases in a type II superconduc- 
tor and hence the superconductor exists naturally in 
a state of finely-separated superconducting and nor- 
mal regions. By adopting a ‘mixed state’ of normal 
and superconducting regions the volume of interface is 
maximized while at the same time keeping the volume 



Figure 6.28 The Meissner effect; shown by the expulsion of 
magnetic flux when the specimen becomes superconducting. 
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of normal conduction as small as possible. The struc- 
ture of the mixed state is believed to consist of lines 
of normal phases parallel to the applied field through 
which the field lines run, embedded in a supercon- 
ducting matrix. The field falls off with distances from 
the centre of each line over the characteristic distance 
X, and vortices or whirlpools of supercurrents flow 
around each line; the flux line, together with its cur- 
rent vortex, is called a fluxoid. At H c i, fluxoids appear 
in the specimen and increase in number as the mag- 
netic field is raised. At H c 2 , the fluxoids completely fill 
the cross-section of the sample and type II supercon- 
ductivity disappears. Type II superconductors are of 
particular interest because of their high critical fields 
which makes them potentially useful for the construc- 
tion of high-field electromagnetics and solenoids. To 
produce a magnetic field of ~10 T with a conven- 
tional solenoid would cost more than ten times that of 
a superconducting solenoid wound with Nb 3 Sn wire. 
By embedding Nb wire in a bronze matrix it is pos- 
sible to form channels of Nb 3 Sn by interdiffusion. 
The conventional installation would require consid- 
erable power, cooling water and space, whereas the 
superconducting solenoid occupies little space, has no 
steady- state power consumption and uses relatively 
little liquid helium. It is necessary, however, for the 
material to carry useful currents without resistance 
in such high fields, which is not usually the case in 
annealed homogeneous type II superconductors. For- 
tunately, the critical current density is extremely sen- 
sitive to micro structure and is markedly increased by 
precipitation-hardening, cold work, radiation damage, 
etc., because the lattice defects introduced pin the flux- 
oids and tend to immobilize them. Figure 6.29 shows 
the influence of metallurgical treatment on the critical 
current density. 
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Figure 6.29 The effect of processing on the J c versus H 
curve of an Nh-25% Zr alloy wire which produces a fine 
precipitate and raises J c ( from Rose, Shepard and Wulff, 
1966; courtesy of John Wiley and Sons). 


6.7.4 Oxide superconductors 

In 1986 a new class of ‘warm’ superconductors, based 
on mixed ceramic oxides, was discovered by J. G. 
Bednorz and K. A. Muller. These lanthanum- copper 
oxide superconductors had a T c around 35 K, well 
above liquid hydrogen temperature. Since then, three 
mixed oxide families have been developed with much 
higher T c values, all around 100 K. Such materials 
give rise to optimism for superconductor technology; 
first, in the use of liquid nitrogen rather than liquid 
hydrogen and second, in the prospect of producing a 
room temperature superconductor. 

The first oxide family was developed by mixing 
and heating the three oxides Y 2 0 3 , BaO and CuO. 
This gives rise to the mixed oxide YBa2Cu 3 0 7 _ x , 
sometimes referred to as 1-2-3 compound or YBCO. 
The structure is shown in Figure 6.30 and is basically 
made by stacking three perovskite-type unit cells one 
above the other; the top and bottom cells have barium 
ions at the centre and copper ions at the corners, the 
middle cell has yttrium at the centre. Oxygen ions sit 
half-way along the cell edges but planes, other than 
those containing barium, have some missing oxygen 
ions (i.e. vacancies denoted by x in the oxide formula). 
This structure therefore has planes of copper and 
oxygen ions containing vacancies, and copper- oxygen 
ion chains perpendicular to them. YBCO has a T c 
value of about 90 K which is virtually unchanged 
when yttrium is replaced by other rare earth elements. 
The second family of oxides are Bi-Ca-Sr-Cu-O x 
materials with the metal ions in the ratio of 2111, 
2122 or 2223, respectively. The 2111 oxide has only 
one copper- oxygen layer between the bismuth-oxygen 
layers, the 2122 two and the 2223 three giving rise to 
an increasing T c up to about 105 K. The third family 
is based on Tl-Ca-Ba-Cu-0 with a 2223 structure 
having three copper- oxygen layers and a T c of about 
125 K. 

While these oxide superconductors have high T c 
values and high critical magnetic field (// c )-values, 
they unfortunately have very low values of / c , the 
critical current density. A high J c is required if they 
are to be used for powerful superconducting magnets. 
Electrical applications are therefore unlikely until the 
J c value can be raised by several orders of magni- 
tude comparable to those of conventional supercon- 
ductors, i.e. 10 6 A cm -2 . The reason for the low J c 
is thought to be largely due to the grain boundaries 
in polycrystalline materials, together with dislocations, 
voids and impurity particles. Single crystals show J c 
values around 10 5 A cm -2 and textured materials, pro- 
duced by melt growth techniques, about 10 4 A cm -2 , 
but both processes have limited commercial applica- 
tion. Electronic applications appear to be more promis- 
ing since it is in the area of thin (1 pm) films that 
high J c values have been obtained. By careful deposi- 
tion control, epitaxial and single-crystal films having 
J c ^ 10 6 A cm -2 with low magnetic field dependence 
have been produced. 
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Figure 6.30 Structure of 1-2-3 compound; the unit cell of the 90 K superconducting perovskite, YBa 2 Cu 3 0y- x , where 
x ~ 0 (by courtesy of P. J. Hirst, Superconductivity Research Group, University of Birmingham, UK). 


6.8 Magnetic properties 

6.8.1 Magnetic susceptibility 

When a metal is placed in a magnetic field of strength 
H , the field induced in the metal is given by 

B = H + 4ttI (6.17) 

where I is the intensity of magnetization. The quantity 
I is a characteristic property of the metal, and is related 
to the susceptibility per unit volume of the metal which 
is defined as 

k = I/H (6.18) 

The susceptibility is usually measured by a method 
which depends upon the fact that when a metal 
specimen is suspended in a non-uniform transverse 
magnetic field, a force proportional to kV.H AH / dx, 
where V is the volume of the specimen and d H /dx 
is the field gradient measured transversely to the lines 
of force, is exerted upon it. This force is easily mea- 
sured by attaching the specimen to a sensitive bal- 
ance, and one type commonly used is that designed 
by Sucksmith. In this balance the distortion of a 
copper-beryllium ring, caused by the force on the 


specimen, is measured by means of an optical or 
electro-mechanical system. Those metals for which k is 
negative, such as copper, silver, gold and bismuth, are 
repelled by the field and are termed diamagnetic mate- 
rials. Most metals, however, have positive k values 
(i.e. they are attracted by the field) and are either para- 
magnetic (when k is small) or ferromagnetic (when k 
is very large). Only four pure metals -iron, cobalt and 
nickel from the transition series, and gadolinium from 
the rare earth series -are ferromagnetic (k ~ 1000) 
at room temperature, but there are several ferromag- 
netic alloys and some contain no metals which are 
themselves ferromagnetic. The Heusler alloy, which 
contains manganese, copper and aluminium, is one 
example; ferromagnetism is due to the presence of one 
of the transition metals. 

The ability of a ferromagnetic metal to concen- 
trate the lines of force of the applied field is of great 
practical importance, and while all such materials can 
be both magnetized and demagnetized, the ease with 
which this can be achieved usually governs their appli- 
cation in the various branches of engineering. Materi- 
als may be generally classified either as magnetically 
soft (temporary magnets) or as magnetically hard (per- 
manent magnets), and the difference between the two 
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Figure 6.31 B-H curves for (a) soft and (b) hard magnets. 


types of magnet may be inferred from Figure 6.31. 
Here, H is the magnetic field necessary to induce a 
field of strength B inside the material. Upon removal 
of the field H , a certain residual magnetism Z? r , known 
as the remanence residual, is left in the specimen, and 
a field H c , called the coercive force, must be applied 
in the opposite direction to remove it. A soft magnet 
is one which is easy both to magnetize and to demag- 
netize and, as shown in Figure 6.31a, a low value of 
H is sufficient to induce a large field B in the metal, 
while only a small field H c is required to remove it; 
a hard magnet is a material that is magnetized and 
demagnetized with difficulty (Figure 6.31b). 

6.8.2 Diamagnetism and paramagnetism 

Diagmagnetism is a universal property of the atom 
since it arises from the motion of electrons in their 
orbits around the nucleus. Electrons moving in this 
way represent electrical circuits and it follows from 
Lenz’s law that this motion is altered by an applied 
field in such a manner as to set up a repulsive force. 
The diamagnetic contribution from the valency elec- 
trons is small, but from a closed shell it is proportional 
to the number of electrons in it and to the square of the 
radius of the ‘orbit’. In many metals this diamagnetic 
effect is outweighed by a paramagnetic contribution, 
the origin of which is to be found in the electron 
spin. Each electron behaves like a small magnet and 
in a magnetic field can take up one of two orienta- 
tions, either along the field or in the other opposite 
direction, depending on the direction of the electron 
spin. Accordingly, the energy of the electron is either 
decreased or increased and may be represented conve- 
niently by the band theory. Thus, if we regard the band 
of energy levels as split into two halves (Figure 6.32a), 
each half associated with electrons of opposite spin, it 
follows that in the presence of the field, some of the 
electrons will transfer their allegiance from one band 
to the other until the Fermi energy level is the same in 
both. It is clear, therefore, that in this state there will be 
a larger number of electrons which have their energy 


lowered by the field than have their energy raised. This 
condition defines paramagnetism, since there will be an 
excess of unpaired spins which give rise to a resultant 
magnetic moment. 

It is evident that an insulator will not be paramag- 
netic since the bands are full and the lowered half-band 
cannot accommodate those electrons which wish to 
‘spill over’ from the raised half-band. On the other 
hand, it is not true, as one might expect, that conduc- 
tors are always paramagnetic. This follows because 
in some elements the natural diamagnetic contribution 
outweighs the paramagnetic contribution; in copper, 
for example, the newly filled d-shell gives rise to a 
larger diamagnetic contribution. 

6.8.3 Ferromagnetism 

The theory of ferromagnetism is difficult and at present 
not completely understood. Nevertheless, from the 
electron theory of metals it is possible to build up 
a band picture of ferromagnetic materials which goes 
a long way to explain not only their ferromagnetic 
properties but also the associated high resistivity and 
electronic specific heat of these metals compared to 
copper. In recent years considerable experimental work 
has been done on the electronic behaviour of the tran- 
sition elements, and this suggests that the electronic 
structure of iron is somewhat different to that of cobalt 
and nickel. 

Ferromagnetism, like paramagnetism, has its ori- 
gin in the electron spin. In ferromagnetic materials, 
however, permanent magnetism is obtained and this 
indicates that there is a tendency for electron spins to 
remain aligned in one direction even when the field 
has been removed. In terms of the band structure this 
means that the half-band associated with one spin is 
automatically lowered when the vacant levels at its 
top are filled by electrons from the top of the other 
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Figure 6.32 Schematic representation of (a) paramagnetic 
nickel and (b) ferromagnetic nickel (after Raynor, 1958; by 
courtesy of Inst, of Materials). 
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(Figure 6.32b); the change in potential energy asso- 
ciated with this transfer is known as the exchange 
energy. Thus, while it is energetically favourable for 
a condition in which all the spins are in the same 
direction, an opposing factor is the Pauli exclusion 
principle, because if the spins are aligned in a single 
direction many of the electrons will have to go into 
higher quantum states with a resultant increase in 
kinetic energy. In consequence, the conditions for 
ferro-magnetism are stringent, and only electrons from 
partially filled d or / levels can take part. This con- 
dition arises because only these levels have (1) vacant 
levels available for occupation, and (2) a high density 
of states which is necessary if the increase in kinetic 
energy accompanying the alignment of spins is to be 
smaller than the decrease in exchange energy. Both of 
these conditions are fulfilled in the transition and rare- 
earth metals, but of all the metals in the long periods 
only the elements iron, cobalt and nickel are ferromag- 
netic at room temperature, gadolinium just above RT 
(T c ~ 16°C) and the majority are in fact strongly para- 
magnetic. This observation has led to the conclusion 
that the exchange interactions are most favourable, at 
least for the iron group of metals, when the ratio of 
the atomic radius to the radius of the unfilled shell, i.e. 
the d-shell, is somewhat greater than 3 (see Table 6.1). 
As a result of this condition it is hardly surprising that 
there are a relatively large number of ferromagnetic 
alloys and compounds, even though the base elements 
themselves are not ferromagnetic. 

In ferromagnetic metals the strong interaction results 
in the electron spins being spontaneously aligned, even 
in the absence of an applied field. However, a specimen 
of iron can exist in an unmagnetized condition because 
such an alignment is limited to small regions, or 
domains, which statistically oppose each other. These 
domains are distinct from the grains of a polycrys- 
talline metal and in general there are many domains 
in a single grain, as shown in Figure 6.33. Under 
the application of a magnetic field the favourably- 
oriented domains grow at the expense of the others 
by the migration of the domain boundaries until the 
whole specimen appears fully magnetized. At high 


Table 6.1 Radii (nm) of electronic orbits of atoms of tran- 
sition metals of first long period (after Slater, Quantum 
Theory of Matter ) 


Element 

3d 

4 s 

Atomic radius 
in metal (nm) 

Sc 

0.061 

0.180 

0.160 

Ti 

0.055 

0.166 

0.147 

V 

0.049 

0.152 

0.136 

Cr 

0.045 

0.141 

0.128 

Mn 

0.042 

0.131 

0.128 

Fe 

0.039 

0.122 

0.128 

Co 

0.036 

0.114 

0.125 

Ni 

0.034 

0.107 

0.125 

Cu 

0.032 

0.103 

0.128 



Figure 6.33 Simple domain structure in a ferromagnetic 
material. The arrows indicate the direction of magnetization 
in the domains. 

field strengths it is also possible for unfavourably- 
oriented domains to ‘snap-over’ into more favourable 
orientations quite suddenly, and this process, which 
can often be heard using sensitive equipment, is known 
as the Barkhausen effect. 

The state in which all the electron spins are in com- 
plete alignment is possible only at low temperatures. 
As the temperature is raised the saturation magnetiza- 
tion is reduced, falling slowly at first and then increas- 
ingly rapidly, until a critical temperature, known as 
the Curie temperature, is reached. Above this temper- 
ature, T c , the specimen is no longer ferromagnetic, but 
becomes paramagnetic, and for the metals iron, cobalt, 
and nickel this transition occurs at 780°C, 1075°C 
and 365°C. respectively. Such a cooperative process 
may be readily understood from thermodynamic rea- 
soning, since the additional entropy associated with 
the disorder of the electron spins makes the disordered 
(paramagnetic) state thermodynamically more stable at 
high temperatures. This behaviour is similar to that 
shown by materials which undergo the order -disorder 
transformation and, as a consequence, ferromagnetic 
metals exhibit a specific heat peak of the form previ- 
ously shown (see Figure 6.4b). 

A ferromagnetic crystal in its natural state has a 
domain structure. From Figure 6.33 it is clear that 
by dividing itself into domains the crystal is able to 
eliminate those magnetic poles which would otherwise 
occur at the surface. The width of the domain boundary 
or Bloch wall is not necessarily small, however, and 
in most materials is of the order of 100 atoms in 
thickness. By having a wide boundary the electron 
spins in neighbouring atoms are more nearly parallel, 
which is a condition required to minimize the exchange 
energy. On the other hand, within any one domain 
the direction of magnetization is parallel to a direction 
of easy magnetization (i.e. (100) in iron, (111) in 
nickel and (001) in cobalt) and as one passes across a 
boundary the direction of magnetization rotates away 
from one direction of easy magnetization to another. 
To minimize this magnetically-disturbed region the 
crystal will try to adopt a boundary which is as thin as 
possible. Consequently, the boundary width adopted is 
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one of compromise between the two opposing effects, 
and the material may be considered to possess a 
magnetic interfacial or surface energy. 

6.8.4 Magnetic alloys 

The work done in moving a domain boundary depends 
on the energy of the boundary, which in turn depends 
on the magnetic anisotropy. The ease of magnetiza- 
tion also depends on the state of internal strain in the 
material and the presence of impurities. Both these 
latter factors affect the magnetic ‘hardness’ through 
the phenomenon of magnetostriction, i.e. the lattice 
constants are slightly altered by the magnetization so 
that a directive influence is put upon the orientation of 
magnetization of the domains. Materials with inter- 
nal stresses are hard to magnetize or demagnetize, 
while materials free from stresses are magnetically 
soft. Hence, since internal stresses are also responsible 
for mechanical hardness, the principle which governs 
the design of magnetic alloys is to make permanent 
magnetic materials as mechanically hard and soft mag- 
nets as mechanically soft as possible. 

Magnetically soft materials are used for trans- 
former laminations and armature stampings where a 
high permeability and a low hysteresis are desirable: 
iron -silicon or iron -nickel alloys are commonly used 
for this purpose. In the development of magnetically 
soft materials it is found that those elements which 
form interstitial solid solutions with iron are those 
which broaden the hysteresis loop most markedly. For 
this reason, it is common to remove such impurities 
from transformer iron by vacuum melting or hydrogen 
annealing. However, such processes are expensive and, 
consequently, alloys are frequently used as ‘soft’ mag- 
nets, particularly iron- silicon and iron-nickel alloys 
(because silicon and nickel both reduce the amount 
of carbon in solution). The role of Si is to form a 
y-loop and hence remove transformation strains and 
also improve orientation control. In the production of 
iron -silicon alloys the factors which are controlled 
include the grain size, the orientation difference from 
one grain to the next, and the presence of non-magnetic 
inclusions, since all are major sources of coercive 
force. The coercive force increases with decreasing 
grain size because the domain pattern in the neigh- 
bourhood of a grain boundary is complicated owing to 
the orientation difference between two adjacent grains. 
Complex domain patterns can also arise at the free 
surface of the metal unless these are parallel to a direc- 
tion of easy magnetization. Accordingly, to minimize 
the coercive force, rolling and annealing schedules are 
adopted to produce a preferred oriented material with 
a strong ‘cube-texture’, i.e. one with two (100) direc- 
tions in the plane of the sheet (see Chapter 7). This 
procedure is extremely important, since transformer 
material is used in the form of thin sheets to mini- 
mize eddy-current losses. Fe-Si-B in the amorphous 
state is finding increasing application in transformers. 


The iron-nickel series, Permalloys , present many 
interesting alloys and are used chiefly in communica- 
tion engineering where a high permeability is a neces- 
sary condition. The alloys in the range 40-55% nickel 
are characterized by a high permeability and at low 
field strengths this may be as high as 15 000 compared 
with 500 for annealed iron. The 50% alloy, Hyper- 
nik , may have a permeability which reaches a value 
of 70 000, but the highest initial and maximum perme- 
ability occurs in the composition range of the FeNi 3 
superlattice, provided the ordering phenomenon is sup- 
pressed. An interesting development in this field is in 
the heat treatment of the alloys while in a strong mag- 
netic field. By such a treatment the permeability of 
Permalloy 65 has been increased to about 260000. 
This effect is thought to be due to the fact that dur- 
ing alignment of the domains, plastic deformation is 
possible and magneto strictive strains may be relieved. 

Magnetically hard materials are used for applica- 
tions where a ‘permanent magnetic field is required, 
but where electromagnets cannot be used, such as 
in electric clocks, meters, etc. Materials commonly 
used for this purpose include Alnico (Al-Ni-Co) 
alloys, Cunico (Cu-Ni-Co) alloys, ferrites (barium 
and strontium), samarium- cobalt alloys (SmCo 5 and 
Sm 2 (Co, Fe, Cu, Zr) 17 ) and Neomax (Nd 2 Fei 4 B). The 
Alnico alloys have high remanence but poor coerciv- 
ities, the ferrites have rather low remanence but good 
coercivities together with very cheap raw material 
costs. The rare-earth magnets have a high performance 
but are rather costly although the Nd-based alloys are 
cheaper than the Sm-based ones. 

In the development of magnetically hard materials, 
the principle is to obtain, by alloying and heat treat- 
ment, a matrix containing finely divided particles of 
a second phase. These fine precipitates, usually dif- 
fering in lattice parameter from the matrix, set up 
coherency strains in the lattice which affect the domain 
boundary movement. Alloys of copper-nickel-iron, 
copper-nickel-cobalt and aluminium-nickel- cobalt 
are of this type. An important advance in this field 
is to make the particle size of the alloy so small, i.e. 
less than a hundred nanometres diameter, that each 
grain contains only a single domain. Then magneti- 
zation can occur only by the rotation of the direc- 
tion of magnetization en bloc. Alnico alloys con- 
taining 6-12% Al, 14-25% Ni, 0-35% Co, 0-8% 
Ti, 0-6% Cu in 40-70% Fe depend on this fea- 
ture and are the most commercially important perma- 
nent magnet materials. They are precipitation-hardened 
alloys and are heat-treated to produce rod-like pre- 
cipitates (30 nm x 100 nm) lying along (100) in the 
bcc matrix. During magnetic annealing the rods form 
along the (100) axis nearest to the direction of the 
field, when the remanence and coercivity are markedly 
increased, Sm 2 (Co, Fe, Cu, Zr) i7 alloys also rely on 
the pinning of magnetic domains by fine precipitates. 
Clear correlation exists between mechanical hardness 
and intrinsic coercivity. SmCos magnets depend on 
the very high magnetocrystalline anisotropy of this 
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compound and the individual grains are single-domain 
particles. The big advantage of these magnets over the 
Alnico alloys is their much higher coercivities. 

The Heusler alloys, copper- manganese -aluminium, 
are of particular interest because they are made up 
from non-ferromagnetic metals and yet exhibit ferro- 
magnetic properties. The magnetism in this group of 
alloys is associated with the compound Cu 2 MnAl, evi- 
dently because of the presence of manganese atoms. 
The compound has the Fe 3 Al-type superlattice when 
quenched from 800° C, and in this state is ferromag- 
netic, but when the alloy is slowly cooled it has 
a p-brass structure and is non-magnetic, presumably 
because the correct exchange forces arise from the lat- 
tice rearrangement on ordering. A similar behaviour is 
found in both the copper- manganese -gallium and the 
copper-manganese-indium systems. 

The order- disorder phenomenon is also of magnetic 
importance in many other systems. As discussed pre- 
viously, when ordering is accompanied by a structural 
change, i.e. cubic to tetragonal, coherency strains are 
set up which often lead to magnetic hardness. In FePt, 
for example, extremely high coercive forces are pro- 
duced by rapid cooling. However, because the change 
in mechanical properties accompanying the transfor- 
mation is found to be small, it has been suggested that 
the hard magnetic properties in this alloy are due to the 
small particle-size effect, which arises from the finely 
laminated state of the structure. 

6.8.5 Anti-ferromagnetism and 
ferrimagnetism 

Apart from the more usual dia-, para- and 
ferromagnetic materials, there are certain substances 
which are termed anti-ferromagnetic; in these, the 
net moments of neighbouring atoms are aligned in 
opposite directions, i.e. anti-parallel. Many oxides 
and chlorides of the transition metals are examples 
including both chromium and o'-manganese, and 
also manganese -copper alloys. Some of the relevant 


features of anti-ferromagnetism are similar in many 
respects to ferromagnetism, and are summarized as 
follows: 

1. In general, the magnetization directions are aligned 
parallel or anti-parallel to crystallographic axes, e.g. 
in Mnl and CoO the moment of the Mn 2+ and Co 2+ 
ions are aligned along a cube edge of the unit cell. 
The common directions are termed directions of 
anti-ferromagnetism. 

2. The degree of long-range anti-ferromagnetic order- 
ing progressively decreases with increasing temper- 
ature and becomes zero at a critical temperature, 
T n , known as the Neel temperature; this is the anti- 
ferromagnetic equivalent of the Curie temperature. 

3. An anti-ferromagnetic domain is a region in which 
there is only one common direction of anti- 
ferromagnetism; this is probably affected by lattice 
defects and strain. 

The most characteristic property of an anti- 
ferromagnetic material is that its susceptibility x shows 
a maximum as a function of temperature, as shown in 
Figure 6.34a. As the temperature is raised from 0 K the 
interaction which leads to anti-parallel spin alignment 
becomes less effective until at T n the spins are free. 

Similar characteristic features are shown in the resis- 
tivity curves due to scattering as a result of spin dis- 
order. However, the application of neutron diffraction 
techniques provides a more direct method of study- 
ing anti-ferromagnetic structures, as well as giving 
the magnetic moments associated with the ions of the 
metal. There is a magnetic scattering of neutrons in 
the case of certain magnetic atoms, and owing to the 
different scattering amplitude of the parallel and anti- 
parallel atoms, the possibility arises of the existence 
of superlattice lines in the anti-ferromagnetic state. 
In manganese oxide MnO, for example, the param- 
eter of the magnetic unit cell is 0.885 nm, whereas the 
chemical unit cell (NaCl structure) is half this value, 
0.443 nm. This atomic arrangement is analogous to the 
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Figure 6.34 (a) Variation of magnetic susceptibility with temperature for an anti-ferromagnetic material, (b) neutron 
diffraction pattern from the anti-ferromagnetic powder MnO above and below the critical temperature for ordering (after Shull 
and Smart, 1949). 



The physical properties of materials 193 


structure of an ordered alloy and the existence of mag- 
netic superlattice lines below the Neel point (122 K) 
has been observed, as shown in Figure 6.34b. 

Some magnetic materials have properties which 
are intermediate between those of anti-ferromagnetic 
and ferromagnetic. This arises if the moments in one 
direction are unequal in magnitude to those in the 
other, as, for example, in magnetite, Fe 3 C> 4 , where the 
ferrous and ferric ions of the FeO.Fe 2 C >3 compound 
occupy their own particular sites. Neel has called this 
state ferrimagnetism and the corresponding materials 
are termed ferrites. Such materials are of importance 
in the field of electrical engineering because they are 
ferromagnetic without being appreciably conducting; 
eddy current troubles in transformers are, therefore, 
not so great. Strontium ferrite is extensively used in 
applications such as electric motors, because of these 
properties and low material costs. 

6.9 Dielectric materials 

6.9.1 Polarization 

Dielectric materials, usually those which are covalent 
or ionic, possess a large energy gap between the 
valence band and the conduction band. These materials 
exhibit high electrical resistivity and have important 
applications as insulators, which prevent the transfer of 
electrical charge, and capacitors which store electrical 
charge. Dielectric materials also exhibit piezoelectric 
and ferroelectric properties. 

Application of an electric field to a material induces 
the formation of dipoles (i.e. atoms or groups of 
atoms with an unbalanced charge or moment) which 
become aligned with the direction of the applied field. 
The material is then polarized. This state can arise 
from several possible mechanisms -electronic, ionic 
or molecular, as shown in Figure 6.35a-c. With elec- 
tronic polarization, the electron clouds of an atom are 
displaced with respect to the positively-charged ion 
core setting up an electric dipole with moment /x e . For 
ionic solids in an electric field, the bonds between the 
ions are elastically deformed and the anion- cation dis- 
tances become closer or further apart, depending on the 
direction of the field. These induced dipoles produce 
polarization and may lead to dimensional changes. 


Molecular polarization occurs in molecular materials, 
some of which contain natural dipoles. Such materials 
are described as polar and for these the influence of an 
applied field will change the polarization by displacing 
the atoms and thus changing the dipole moment (i.e. 
atomic polarizability) or by causing the molecule as a 
whole to rotate to line up with the imposed field (i.e. 
orientation polarizability). When the field is removed 
these dipoles may remain aligned, leading to perma- 
nent polarization. Permanent dipoles exist in asymmet- 
rical molecules such as H 2 0, organic polymers with 
asymmetric structure and ceramic crystals without a 
centre of symmetry. 


6.9.2 Capacitors and insulators 

In a capacitor the charge is stored in a dielectric mate- 
rial which is easily polarized and has a high electrical 
resistivity ~10 n V A -1 m to prevent the charge flow- 
ing between conductor plates. The ability of the mate- 
rial to polarize is expressed by the permittivity s and 
the relative permittivity or dielectric constant k is the 
ratio of the permittivity of the material and the permit- 
tivity of a vacuum s 0 . While a high k is important for a 
capacitor, a high dielectric strength or breakdown volt- 
age is also essential. The dielectric constant k values 
for vacuum, water, polyethylene, Pyrex glass, alumina 
and barium titanate are 1, 78.3, 2.3, 4, 6.5 and 3000, 
respectively. 

Structure is an important feature of dielectric 
behaviour. Glassy polymers and crystalline materials 
have a lower k than their amorphous counterparts. 
Polymers with asymmetric chains have a high k 
because of the strength of the associated molecular 
dipole; thus polyvinyl chloride (PVC) and polystyrene 
(PS) have larger /c’s than polyethylene (PE). BaTiC >3 
has an extremely high k value because of its 
asymmetrical structure. Frequency response is also 
important in dielectric application, and depends on 
the mechanism of polarization. Materials which rely 
on electronic and ionic dipoles respond rapidly 
to frequencies of 10 13 — 10 16 Hz but molecular 
polarization solids, which require groups of atoms 
to rearrange, respond less rapidly. Frequency is also 
important in governing dielectric loss due to heat and 
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Figure 6.35 Application of field to produce polarization by (a) electronic, (b) ionic and (c) molecular mechanisms. 



194 Modern Physical Metallurgy and Materials Engineering 


usually increases when one of the contributions to 
polarization is prevented. This behaviour is common in 
microwave heating of polymer adhesives; preferential 
heating in the adhesive due to dielectric losses starts 
the thermosetting reaction. For moderate increases, 
raising the voltage and temperature increases the 
polarizability and leads to a higher dielectric constant. 
Nowadays, capacitor dielectrics combine materials 
with different temperature dependence to yield a final 
product with a small linear temperature variation. 
These materials are usually titanates of Ba, Ca, Mg, 
Sr and rare-earth metals. 

For an insulator, the material must possess a high 
electrical resistivity, a high dielectric strength to pre- 
vent breakdown of the insulator at high voltages, a 
low dielectric loss to prevent heating and small dielec- 
tric constant to hinder polarization and hence charge 
storage. Materials increasingly used are alumina, alu- 
minium nitride, glass-ceramics, steatite porcelain and 
glasses. 


6.9.3 Piezoelectric materials 

When stress is applied to certain materials an elec- 
tric polarization is produced proportional to the stress 
applied. This is the well-known piezoelectric effect. 
Conversely, dilatation occurs on application of an 
electric field. Common materials displaying this prop- 
erty are quartz, BaTi0 3 , Pb(Ti, Zr)0 3 or PZT and 
Na or LiNb0 3 . For quartz, the piezoelectric con- 
stant d relating strain s to field strength F(e = 
d x F) is 2.3 x 10 -12 m V -1 , whereas for PZT it 
is 250 x 10 -12 m V -1 . The piezoelectric effect is 
used in transducers which convert sound waves to 
electric fields, or vice versa. Applications range 
from microphones, where a few millivolts are gen- 
erated, to military devices creating several kilo- 
volts and from small sub-nanometre displacements in 
piezoelectrically-deformed mirrors to large deforma- 
tions in power transducers. 

6.9.4 Pyroelectric and ferroelectric materials 

Some materials, associated with low crystal symmetry, 
are observed to acquire an electric charge when 
heated; this is known as pyroelectricity. Because 
of the low symmetry, the centre of gravity of the 
positive and negative charges in the unit cell are 
separated producing a permanent dipole moment. 
Moreover, alignment of individual dipoles leads to 
an overall dipole moment which is non-zero for the 
crystal. Pyroelectric materials are used as detectors 
of electromagnetic radiation in a wide band from 
ultraviolet to microwave, in radiometers and in 
thermometers sensitive to changes of temperature as 
small as 6 x 10 -6 C. Pyroelectric TV camera tubes 
have also been developed for long- wavelength infrared 
imaging and are useful in providing visibility through 
smoke. Typical materials are strontium barium niobate 


and PZT with Pb 2 FeNb0 6 additions to broaden the 
temperature range of operation. 

Ferroelectric materials are those which retain a 
net polarization when the field is removed and is 
explained in terms of the residual alignment of per- 
manent dipoles. Not all materials that have perma- 
nent dipoles exhibit ferroelectric behaviour because 
these dipoles become randomly- arranged as the field 
is removed so that no net polarization remains. Ferro- 
electric s are related to the pyroelectrics; for the former 
materials the direction of spontaneous polarization can 
be reversed by an electric field (Figure 6.36) whereas 
for the latter this is not possible. This effect can be 
demonstrated by a polarization versus field hysteresis 
loop similar in form and explanation to the B-H mag- 
netic hysteresis loop (see Figure 6.31). With increasing 
positive field all the dipoles align to produce a satura- 
tion polarization. As the field is removed a remanent 
polarization P r remains due to a coupled interaction 
between dipoles. The material is permanently polarized 
and a coercive field E c has to be applied to randomize 
the dipoles and remove the polarization. 

Like ferromagnetism, ferroelectricity depends on 
temperature and disappears above an equivalent Curie 
temperature. For BaTi0 3 , ferroelectricity is lost at 
120°C when the material changes crystal structure. By 
analogy with magnetism there is also a ferroelectric 
analogue of anti-ferromagnetism and ferrimagnetism. 
NaNb0 3 , for example, has a T c , of 640°C and anti- 
parallel electric dipoles of unequal moments charac- 
teristic of a ferrielectric material. 



Figure 6.36 Hysteresis loop for ferroelectric materials, 
showing the influence of electric field E on polarization P. 
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6.10 Optical properties 

6.10.1 Reflection, absorption and 
transmission effects 

The optical properties of a material are related to the 
interaction of the material with electromagnetic radi- 
ation, particularly visible light. The electromagnetic 
spectrum is shown in Figure 5.1 from which it can be 
seen that the wavelength A varies from 10 4 m for radio 
waves down to 10“ 14 m for y-rays and the correspond- 
ing photon energies vary from 10“ 10 eV to 10 8 eV. 

Photons incident on a material may be reflected, 
absorbed or transmitted. Whether a photon is absorbed 
or transmitted by a material depends on the energy gap 
between the valency and conduction bands and the 
energy of the photon. The band structure for metals 
has no gap and so photons of almost any energy are 
absorbed by exciting electrons from the valency band 
into a higher energy level in the conduction band. 
Metals are thus opaque to all electromagnetic radiation 
from radio waves, through the infrared, the visible to 
the ultraviolet, but are transparent to high-energy X- 
rays and y-rays. Much of the absorbed radiation is 
reemitted as radiation of the same wavelength (i.e. 
reflected). Metals are both opaque and reflective and 
it is the wavelength distribution of the reflected light, 
which we see, that determines the colour of the metal. 
Thus copper and gold reflect only a certain range of 
wavelengths and absorb the remaining photons, i.e. 
copper reflects the longer-wavelength red light and 
absorbs the shorter-wavelength blue. Aluminium and 
silver are highly reflective over the complete range of 
the visible spectrum and appear silvery. 

Because of the gaps in their band structure non- 
metals may be transparent. Thus if the photons have 
insufficient energy to excite electrons in the mate- 
rial to a higher energy level, they may be transmitted 
rather than absorbed and the material is transparent. 
In high-purity ceramics and polymers, the energy gap 
is large and these materials are transparent to visible 
light. In semiconductors, electrons can be excited into 
acceptor levels or out of donor levels and phonons hav- 
ing sufficient energy to produce these transitions will 
be absorbed. Semiconductors are therefore opaque to 
short wavelengths and transparent to long. 1 The band 
structure is influenced by crystallinity and hence mate- 
rials such as glasses and polymers may be transparent 
in the amorphous state but opaque when crystalline. 

High-purity non-metallics such as glasses, diamond 
or sapphire (A1 2 0 3 ) are colourless but are changed by 
impurities. For example, small additions of Cr 3+ ions 
(Cr 2 0 3 ) to A1 2 0 3 produces a ruby colour by introduc- 
ing impurity levels within the band-gap of sapphire 
which give rise to absorption of specific wavelengths in 

1 Figure 5.37b shows a dislocation source in the interior of a 

silicon crystal observed using infrared light. 


the visible spectrum. Colouring of glasses and ceram- 
ics is produced by addition of transition metal impu- 
rities which have unfilled d- shells. The photons easily 
interact with these ions and are absorbed; Cr 3+ gives 
green, Mn 2+ yellow and Co 2+ blue-violet colouring. 

In photochromic sunglasses the energy of light 
quanta is used to produce changes in the ionic structure 
of the glass. The glass contains silver (Ag + ) ions as a 
dopant which are trapped in the disordered glass net- 
work of silicon and oxygen ions: these are excited by 
high-energy quanta (photons) and change to metallic 
silver, causing the glass to darken (i.e. light energy is 
absorbed). With a reduction in light intensity, the silver 
atoms re-ionize. These processes take a small period of 
time relying on absorption and non-absorption of light. 

6.10.2 Optical fibres 

Modem communication systems make use of the abil- 
ity of optical fibres to transmit light signals over large 
distances. Optical guidance by a fibre is produced (see 
Figure 6.37) if a core fibre of refractive index n\ is 
surrounded by a cladding of slightly lower index n 2 
such that total internal reflection occurs confining the 
rays to the core; typically the core is about 100 pm 
and ni — n 2 ~ 10~ 2 . With such a simple optical fibre, 
interference occurs between different modes leading 
to a smearing of the signals. Later designs use a 
core in which the refractive index is graded, parabol- 
ically, between the core axis and the interface with 
the cladding. This design enables modulated signals to 
maintain their coherency. In vitreous silica, the refrac- 
tive index can be modified by additions of dopants such 
as P 2 0 5 , Ge0 2 which raise it and B 2 0 5 and F which 
lower it. Cables are sheathed to give strength and envi- 
ronmental protection; PE and PVC are commonly used 
for limited fire-hazard conditions. 

6.10.3 Lasers 

A laser (Light Amplification by Stimulated Emission 
of Radiation) is a powerful source of coherent light 
(i.e. monochromatic and all in phase). The original 
laser material, still used, is a single crystal rod of 
mby, i.e. A1 2 0 3 containing dopant Cr 3+ ions in solid 
solution. Nowadays, lasers can be solid, liquid or 
gaseous materials and ceramics, glasses and semicon- 
ductors. In all cases, electrons of the laser material 
are excited into a higher energy state by some suitable 
stimulus (see Figure 6.38). In a device this is produced 
by the photons from a flash tube, to give an intense 



Figure 6.37 Optical guidance in a multimode fibre. 
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source of light surrounding the rod to concentrate the 
energy into the laser rod. Alternatively an electrical 
discharge in a gas is used. The ends of the laser rod 
are polished flat and parallel and then silvered such 
that one end is totally-reflecting and the other partially- 
transmitting. 

In the ruby laser, a xenon flash lamp excites the 
electrons of the Cr 3+ ions into higher energy states. 
Some of these excited electrons decay back to their 
ground state directly and are not involved in the 
laser process. Other electrons decay into a metastable 
intermediate state before further emission returns them 
to the ground state. Some of the electrons in the 
metastable state will be spontaneously emitted after a 
short (~ms) rest period. Some of the resultant photons 
are retained in the rod because of the reflectivity of 
the silvered ends and stimulate the release of other 
electrons from the metastable state. Thus one photon 
releases another such that an avalanche of emissions 
is triggered all in phase with the triggering photon. 
The intensity of the light increases as more emissions 
are stimulated until a very high intense, coherent, 
collimated ‘burst’ of light is transmitted through the 
partially-silvered end lasting a few nanoseconds and 
with considerable intensity. 

6.10.4 Ceramic ‘windows’ 

Many ceramics, usually oxides, have been prepared in 
optically-transparent or translucent forms (translucent 
means that incident light is partly-reflected and partly- 
transmitted). Examples include aluminium oxide, 
magnesium oxide, their double oxide or spinel, and 
chalcogenides (zinc sulphide, zinc selenide). Very 
pure raw materials of fine particle size are carefully 
processed to eliminate voids and control grain sizes. 


Thus, translucent alumina is used for the arc tube of 
high-pressure sodium lamps; a grain size of 25 pm 
gives the best balance of translucency and mechanical 
strength. 

Ceramics are also available to transmit electromag- 
netic radiation with wavelengths which he below or 
above the visible range of 400-700 nm (e.g. infrared, 
microwave, radar, etc.). Typical candidate materials for 
development include vitreous silica, cordierite glass- 
ceramics and alumina. 

6.10.5 Electro-optic ceramics 

Certain special ceramics combine electrical and optical 
properties in a unique manner. Lead lanthanum zirco- 
nium titanate, known as PLZT, is a highly-transparent 
ceramic which becomes optically birefringent when 
electrically charged. This phenomenon is utilized as 
a switching mechanism in arc welding goggles, giv- 
ing protection against flash blindness. The PLZT plate 
is located between two ‘crossed’ sheets of polarizing 
material. A small impressed d.c. voltage on the PLZT 
plate causes it to split the incident light into two rays 
vibrating in different planes. One of these rays can 
pass through the inner polar sheet and enter the eye. A 
sudden flash of light will activate photodiodes in the 
goggles, reduce the impressed voltage and cause rapid 
darkening of the goggles. 
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Chapter 7 

Mechanical behaviour of materials 


7.1 Mechanical testing procedures 

7.1.1 Introduction 

Real crystals, however carefully prepared, contain 
lattice imperfections which profoundly affect those 
properties sensitive to structure. Careful examination 
of the mechanical behaviour of materials can give 
information on the nature of these atomic defects. 
In some branches of industry the common mechani- 
cal tests, such as tensile, hardness, impact, creep and 
fatigue tests, may be used, not to study the ‘defect 
state’ but to check the quality of the product pro- 
duced against a standard specification. Whatever its 
purpose, the mechanical test is of importance in the 
development of both materials science and engineer- 
ing properties. It is inevitable that a large number of 
different machines for performing the tests are in gen- 
eral use. This is because it is often necessary to know 
the effect of temperature and strain rate at different 
levels of stress depending on the material being tested. 
Consequently, no attempt is made here to describe the 
details of the various testing machines. The elements 
of the various tests are outlined below. 

7.1.2 The tensile test 

In a tensile test the ends of a test piece are fixed into 
grips, one of which is attached to the load-measuring 
device on the tensile machine and the other to the 
straining device. The strain is usually applied by means 
of a motor-driven crosshead and the elongation of 
the specimen is indicated by its relative movement. 
The load necessary to cause this elongation may be 
obtained from the elastic deflection of either a beam 
or proving ring, which may be measured by using 
hydraulic, optical or electromechanical methods. The 
last method (where there is a change in the resistance 
of strain gauges attached to the beam) is, of course, 
easily adapted into a system for autographically record- 
ing the load -elongation curve. 


The load -elongation curves for both poly crystalline 
mild steel and copper are shown in Figures 7.1a and 
7.1b. The corresponding stress (load per unit area, 
P/A ) versus strain (change in length per unit length, 
d ///) curves may be obtained knowing the dimensions 
of the test piece. At low stresses the deformation is 
elastic, reversible and obeys Hooke’s law with stress 
linearly proportional to strain. The proportionality con- 
stant connecting stress and strain is known as the 
elastic modulus and may be either (a) the elastic or 
Young’s modulus, E, (b) the rigidity or shear modulus 
/x, or (c) the bulk modulus K , depending on whether 
the strain is tensile, shear or hydrostatic compressive, 
respectively. Young’s modulus, bulk modulus, shear 
modulus and Poisson’s ratio v, the ratio of lateral con- 
tractions to longitudinal extension in uniaxial tension, 
are related according to 

E E 9K/jl 

2(1 — 2v) ^ 2(1 + v) 3 K + ii 

(7.1) 

In general, the elastic limit is an ill-defined stress, 
but for impure iron and low carbon steels the onset 
of plastic deformation is denoted by a sudden drop in 
load indicating both an upper and lower yield point. 1 
This yielding behaviour is characteristic of many met- 
als, particularly those with bcc structure containing 
small amounts of solute element (see Section 7.4.6). 
For materials not showing a sharp yield point, a con- 
ventional definition of the beginning of plastic flow is 
the 0.1% proof stress, in which a line is drawn parallel 

^oad relaxations are obtained only on ‘hard’ beam 
Polanyi-type machines where the beam deflection is small 
over the working load range. With ‘soft’ machines, those in 
which the load-measuring device is a soft spring, rapid load 
variations are not recorded because the extensions required 
are too large, while in dead-loading machines no load 
relaxations are possible. In these latter machines sudden 
yielding will show as merely an extension under constant 
load. 
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(b) 

Figure 7.1 Stress-elongation curves for (a) impure iron, (b) copper, (c) ductile -brittle transition in mild steel ( after 
Churchman, Mogford and Cottrell, 1957). 


to the elastic portion of the stress -strain curve from 
the point of 0.1% strain. 

For control purposes the tensile test gives valuable 
information on the tensile strength (TS = maximum 
load/original area) and ductility (percentage reduction 
in area or percentage elongation) of the material. When 
it is used as a research technique, however, the exact 
shape and fine details of the curve, in addition to the 
way in which the yield stress and fracture stress vary 
with temperature, alloying additions and grain size, are 
probably of greater significance. 

The increase in stress from the initial yield up to the 
TS indicates that the specimen hardens during defor- 
mation (i.e. work-hardens). On straining beyond the 
TS the metal still continues to work-harden, but at 
a rate too small to compensate for the reduction in 
cross-sectional area of the test piece. The deforma- 
tion then becomes unstable, such that as a localized 
region of the gauge length strains more than the rest, 
it cannot harden sufficiently to raise the stress for fur- 
ther deformation in this region above that to cause 
further strain elsewhere. A neck then forms in the 
gauge length, and further deformation is confined to 
this region until fracture. Under these conditions, the 
reduction in area (A 0 — Ai)/A 0 where A 0 and A\ are 
the initial and final areas of the neck gives a mea- 
sure of the localized strain, and is a better indication 


than the strain to fracture measured along the gauge 
length. 

True stress -true strain curves are often plotted to 
show the work hardening and strain behaviour at large 
strains. The true stress o is the load P divided by the 
area A of the specimen at that particular stage of strain 
and the total true strain in deforming from initial length 
/o to length l\ is e — //j (d/ // ) = ln(/i// 0 ). The true 
stress -strain curves often fit the Ludwig relation a = 
ks n where n is a work-hardening coefficient ~0.1 -0.5 
and k the strength coefficient. Plastic instability, or 
necking, occurs when an increase in strain produces 
no increase in load supported by the specimen, i.e. 
d P = 0, and hence since P = aA, then 

d P = Ada + adA = 0 

defines the instability condition. During deformation, 
the specimen volume is essentially constant (i.e. dV = 
0) and from 

dF = d(/A) = Adi + ZdA = 0 
we obtain 


Thus, necking occurs at a strain at which the slope 
of the true stress -true strain curve equals the true 
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Figure 7.2 Considered construction. 

stress at that strain, i.e. da/d s = a. Alternatively, since 
ks n = a = da/d£ = nks n ~ l then s = n and necking 
occurs when the true strain equals the strain-hardening 
exponent. The instability condition may also be 
expressed in terms of the conventional (nominal strain) 

da da d s n da /d///o\ da / 

d s d s n d s d s n \ dl/l J d s n lo 

da 

= — (l+e„) = a (7.3) 

d s n 

which allows the instability point to be located using 
Considered construction (see Figure 7.2), by plotting 
the true stress against nominal strain and drawing the 
tangent to the curve from e n = — 1 on the strain axis. 
The point of contact is the instability stress and the 
tensile strength is a/(l + e n ). 

Tensile specimens can also give information on the 
type of fracture exhibited. Usually in poly crystalline 
metals transgranular fractures occur (i.e. the fracture 
surface cuts through the grains) and the ‘cup and cone’ 
type of fracture is extremely common in really duc- 
tile metals such as copper. In this, the fracture starts 
at the centre of the necked portion of the test piece 
and at first grows roughly perpendicular to the tensile 
axis, so forming the ‘cup’, but then, as it nears the 
outer surface, it turns into a ‘cone’ by fracturing along 
a surface at about 45° to the tensile axis. In detail 
the ‘cup’ itself consists of many irregular surfaces at 
about 45° to the tensile axis, which gives the fracture a 
fibrous appearance. Cleavage is also a fairly common 
type of transgranular fracture, particularly in materi- 
als of bcc structure when tested at low temperatures. 
The fracture surface follows certain crystal planes (e.g. 
{100} planes), as is shown by the grains revealing 
large bright facets, but the surface also appears gran- 
ular with ‘river lines’ running across the facets where 
cleavage planes have been tom apart. Intercrystalline 
fractures sometimes occur, often without appreciable 
deformation. This type of fracture is usually caused 
by a brittle second phase precipitating out around the 
grain boundaries, as shown by copper containing bis- 
muth or antimony. 

7.1.3 Indentation hardness testing 

The hardness of a metal, defined as the resistance to 
penetration, gives a conveniently rapid indication of 


its deformation behaviour. The hardness tester forces 
a small sphere, pyramid or cone into the surface of 
the metals by means of a known applied load, and the 
hardness number (Brinell or Vickers diamond pyramid) 
is then obtained from the diameter of the impression. 
The hardness may be related to the yield or tensile 
strength of the metal, since during the indentation, the 
material around the impression is plastically deformed 
to a certain percentage strain. The Vickers hardness 
number (VPN) is defined as the load divided by the 
pyramidal area of the indentation, in kgf/mm 2 , and is 
about three times the yield stress for materials which 
do not work harden appreciably. The Brinell hardness 
number (BHN) is defined as the stress P/A , in kgf/mm 2 
where P is the load and A the surface area of the 
spherical cap forming the indentation. Thus 

BHN = p/ (^D 2 ) {1 - [1 - (d/D) 2 ] 1 ' 2 } 

where d and D are the indentation and indentor diam- 
eters respectively. For consistent results the ratio d/D 
should be maintained constant and small. Under these 
conditions soft materials have similar values of BHN 
and VPN. Hardness testing is of importance in both 
control work and research, especially where informa- 
tion on brittle materials at elevated temperatures is 
required. 

7.1.4 Impact testing 

A material may have a high tensile strength and yet 
be unsuitable for shock loading conditions. To deter- 
mine this the impact resistance is usually measured by 
means of the notched or un-notched Izod or Charpy 
impact test. In this test a load swings from a given 
height to strike the specimen, and the energy dissi- 
pated in the fracture is measured. The test is partic- 
ularly useful in showing the decrease in ductility and 
impact strength of materials of bcc structure at mod- 
erately low temperatures. For example, carbon steels 
have a relatively high ductile -brittle transition tem- 
perature (Figure 7.1c) and, consequently, they may be 
used with safety at sub-zero temperatures only if the 
transition temperature is lowered by suitable alloy- 
ing additions or by refining the grain size. Nowadays, 
increasing importance is given to defining a fracture 
toughness parameter K c for an alloy, since many alloys 
contain small cracks which, when subjected to some 
critical stress, propagate; K c defines the critical com- 
bination of stress and crack length. Brittle fracture is 
discussed more fully in Chapter 8. 

7.1.5 Creep testing 

Creep is defined as plastic flow under constant stress, 
and although the majority of tests are carried out under 
constant load conditions, equipment is available for 
reducing the loading during the test to compensate 
for the small reduction in cross-section of the spec- 
imen. At relatively high temperatures creep appears to 
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Figure 7.3 Typical creep curves. 

occur at all stress levels, but the creep rate increase 
with increasing stress at a given temperature. For the 
accurate assessment of creep properties, it is clear that 
special attention must be given to the maintenance of 
the specimen at a constant temperature, and to the mea- 
surement of the small dimensional changes involved. 
This latter precaution is necessary, since in many mate- 
rials a rise in temperature by a few tens of degrees is 
sufficient to double the creep rate. Figure 7.3, curve a, 
shows the characteristics of a typical creep curve and 
following the instantaneous strain caused by the sud- 
den application of the load, the creep process may be 
divided into three stages, usually termed primary or 
transient creep, second or steady-state creep and ter- 
tiary or accelerating creep. The characteristics of the 
creep curve often vary, however, and the tertiary stage 
of creep may be advanced or retarded if the tempera- 
ture and stress at which the test is carried out is high 
or low respectively (see Figure 7.3, curves b and c ). 
Creep is discussed more fully in Section 7.9. 

7.1.6 Fatigue testing 

The fatigue phenomenon is concerned with the prema- 
ture fracture of metals under repeatedly applied low 
stresses, and is of importance in many branches of 
engineering (e.g. aircraft structures). Several differ- 
ent types of testing machines have been constructed 
in which the stress is applied by bending, torsion, 




tension or compression, but all involve the same prin- 
ciple of subjecting the material to constant cycles of 
stress. To express the characteristics of the stress sys- 
tem, three properties are usually quoted: these include 
(1) the maximum range of stress, (2) the mean stress, 
and (3) the time period for the stress cycle. Four dif- 
ferent arrangements of the stress cycle are shown in 
Figure 7.4, but the reverse and the repeated cycle tests 
(e.g. ‘push-pull’) are the most common, since they are 
the easiest to achieve in the laboratory. 

The standard method of studying fatigue is to pre- 
pare a large number of specimens free from flaws, 
and to subject them to tests using a different range 
of stress, S , on each group of specimens. The num- 
ber of stress cycles, N , endured by each specimen at 
a given stress level is recorded and plotted, as shown 
in Figure 7.5. This S-N diagram indicates that some 
metals can withstand indefinitely the application of a 
large number of stress reversals, provided the applied 
stress is below a limiting stress known as the endurance 
limit. For certain ferrous materials when they are used 
in the absence of corrosive conditions the assumption 
of a safe working range of stress seems justified, but 
for non-ferrous materials and for steels when they are 
used in corrosive conditions a definite endurance limit 
cannot be defined. Fatigue is discussed in more detail 
in Section 7.11. 

7.1.7 Testing of ceramics 

Direct tensile testing of ceramics is not generally 
favoured, mainly because of the extreme sensitivity of 
ceramics to surface flaws. First, it is difficult to apply 
a truly uniaxial tensile stress: mounting the specimen 
in the machine grips can seriously damage the surface 
and any bending of the specimen during the test will 
cause premature failure. Second, suitable waisted spec- 
imens with the necessary fine and flawless finish are 
expensive to produce. It is therefore common practice 
to use bend tests for engineering ceramics and glasses. 
(They have long been used for other non-ductile mate- 
rials such as concretes and grey cast iron.) In the three- 
and four-point bend methods portrayed in Figure 7.6, a 
beam specimen is placed between rollers and carefully 
loaded at a constant strain rate. The flexural strength 
at failure, calculated from the standard formulae, is 



Figure 7.4 Alternative forms of stress cycling: (a) reversed; (b) alternating ( mean stress / zero), (c) fluctuating and 
(d) repeated. 
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Figure 7.5 S-N curve for carburized and decarburized iron. 
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Figure 7.6 Bend test configurations. MoR = modulus of 
rupture, F = applied force, L = outer span, Li = inner 
span, b = breadth of specimen, d = depth of specimen. 


known as the modulus of rupture (MoR) and expresses 
the maximum tensile stress which develops on the con- 
vex face of the loaded beam. Strong ceramics, such as 
silicon carbide and hot-pressed silicon nitride, have 
very high MoR values. The four-point loading method 
is often preferred because it subjects a greater volume 
and area of the beam to stress and is therefore more 
searching. MoR values from four-point tests are often 
substantially lower than those from three-point tests 
on the same material. Similarly, strength values tend 
to decrease as the specimen size is increased. To pro- 
vide worthwhile data for quality control and design 
activities, close attention must be paid to strain rate 
and environment, and to the size, edge finish and sur- 
face texture of the specimen. With oxide ceramics and 
silica glasses, a high strain rate will give an appre- 
ciably higher flexural strength value than a low strain 
rate, which leads to slow crack growth and delayed 
fracture (Section 10.7). 

The bend test has also been adapted for use at high 
temperatures. In one industrial procedure, specimens 
of magnesia (basic) refractory are fed individually 
from a magazine into a three-point loading zone at the 


centre of an electric furnace heated by SiC elements. 
A similar type of hot-bend test has been used for 
the routine testing of graphite electrode samples and 
gives a useful indication of their ability to withstand 
accidental lateral impact during service in steel melting 
furnaces. 

Proof-testing is a long-established method of test- 
ing certain engineering components and structures. 
In a typical proof test, each component is held at a 
certain proof stress for a fixed period of time; load- 
ing and unloading conditions are standardized. In the 
case of ceramics, it may involve bend-testing, inter- 
nal pressurization (for tubes) or rotation at high speed 
(‘overspeeding’ of grinding wheels). Components that 
withstand the proof test are, in the simplest analysis, 
judged to be sound and suitable for long-term service 
at the lower design stress. The underlying philosophy 
has been often questioned, not least because there is a 
risk that the proof test itself may cause incipient crack- 
ing. Nevertheless, proof-testing now has an important 
role in the statistical control of strength in ceramics. 


7.2 Elastic deformation 

7.2.1 Elastic deformation of metals 

It is well known that metals deform both elastically 
and plastically. Elastic deformation takes place at low 
stresses and has three main characteristics, namely 
(1) it is reversible, (2) stress and strain are linearly 
proportional to each other according to Hooke’s Law 
and (3) it is usually small (i.e. <1% elastic strain). 

The stress at a point in a body is usually defined 
by considering an infinitesimal cube surrounding that 
point and the forces applied to the faces of the cube by 
the surrounding material. These forces may be resolved 
into components parallel to the cube edges and when 
divided by the area of a face give the nine stress 
components shown in Figure 7.7. A given component 
o ij is the force acting in the j -direction per unit 
area of face normal to the /-direction. Clearly, when 
/ = j we have normal stress components (e.g. o xx ) 
which may be either tensile (conventionally positive) 
or compressive (negative), and when i ^ j (e.g. o xy ) 
the stress components are shear. These shear stresses 
exert couples on the cube and to prevent rotation of the 
cube the couples on opposite faces must balance and 
hence <r i; = op . 1 Thus, stress has only six independent 
components. 

When a body is strained, small elements in that 
body are displaced. If the initial position of an element 

lr The nine components of stress 077 form a second-rank 
tensor usually written 

®xx ®xy ®xz 

®yx ®yy ®yz 

®zx ®z.y ®z.z 

and is known as the stress tensor. 
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Figure 7.7 Normal and shear stress components. 

is defined by its coordinates (x, y, z) and its final 
position by (x + u, y + v, z + w) then the displacement 
is ( u , v, w). If this displacement is constant for all 
elements in the body, no strain is involved, only a 
rigid translation. For a body to be under a condition 
of strain the displacements must vary from element 
to element. A uniform strain is produced when the 
displacements are linearly proportional to distance. In 
one dimension then u = ex where e = du/dx is the 
coefficient of proportionality or nominal tensile strain. 
For a three-dimensional uniform strain, each of the 
three components u, v, w is made a linear function in 
terms of the initial elemental coordinates, i.e. 

u = e xx x + + e xz z 

V Cy X X ~ l - Cyyy “ C y Z Z 

w = e ZK x + e zy y + 

The strains e xx = du/dx , e yy = du/d y, e zz = dw/dz are 
the tensile strains along the x, y and z axes, respec- 
tively. The strains e xy , e yz , etc., produce shear strains 
and in some cases a rigid body rotation. The rotation 
produces no strain and can be allowed for by rotat- 
ing the reference axes (see Figure 7.8). In general, 
therefore, e t j = + (Oij with £ ZJ - the strain compo- 

nents and ooij the rotation components. If, however, 
the shear strain is defined as the angle of shear, this 
is twice the corresponding shear strain component, i.e. 
y = 2s ij . The strain tensor, like the stress tensor, has 
nine components which are usually written as: 

i i 

£ £ £ £ xx 2 Yxy 2 Yxz 

°xx °xy °xz 1 | 

£ yx Syy £ y Z Or 2 y y% £ yy 2 ^ 

&zx £ zy £ zz i i 

2 Yzx 2 Yzy £ zz 

where s xx etc. are tensile strains and y xy , etc. are 
shear strains. All the simple types of strain can be 
produced from the strain tensor by setting some of 
the components equal to zero. For example, a pure 
dilatation (i.e. change of volume without change of 
shape) is obtained when £ xx = £ yy = s zz and all other 
components are zero. Another example is a uniaxial 
tensile test when the tensile strain along the x-axis 



Figure 7.8 Deformation of a square OABC to a 
parallelogram PQRS involving (i) a rigid body translation 
OP allowed for by redefining new axes X'Y' , (ii) a rigid body 
rotation allowed for by rotating the axes to X" Y" , and (iii) a 
change of shape involving both tensile and shear strains. 

is simply e = s xx . However, because of the strains 
introduced by lateral contraction, s yy = —ve and £ zz = 
— ve, where v is Poisson’s ratio; all other components 
of the strain tensor are zero. 

At small elastic deformations, the stress is linearly 
proportional to the strain. This is Hooke’s law and in 
its simplest form relates the uniaxial stress to the uni- 
axial strain by means of the modulus of elasticity. For 
a general situation, it is necessary to write Hooke’s law 
as a linear relationship between six stress components 
and the six strain components, i.e. 

®xx = C\\£xx T C\2 £yy “b C\2,£ zz + C 14 y yz + Cl5Yzx T ^16 Yxy 
°yy~ c 2\£xx + c 22^yy + %3 + c 2\Yyz + c 25 Yzx + c 26 Yxy 
®zz — C?>\£xx T C32&yy T C 33 £ zz ~b C^Yyz T C^sYzx T ^36 Yxy 
Ty Z = C 41 £ xx + C 42 £yy + C 4 3% + ?44 Yyz + c 45 Yzx + c 46 Yxy 
^zx~ c 5\£xx + c 52 £yy + <^ 53 % + CsA-Yyz + c 55 Yzx + c 56 Yxy 
Txy = ^61^xx T C62 £yy T C63^zz T C64K_yz ^6 5Yzx T C66Y X y 

The constants cn, C 12 , • • • , Q; are called the elastic 
stiffness constants. 1 

Taking account of the symmetry of the crystal, many 
of these elastic constants are equal or become zero. 
Thus in cubic crystals there are only three indepen- 
dent elastic constants c n, ci 2 and c 44 for the three 
independent modes of deformation. These include the 
application of (1) a hydrostatic stress p to produce a 
dilatation 0 given by 

p = -3(Cjj +2 c 12 )@ = -K 

where k is the bulk modulus, (2) a shear stress on a 
cube face in the direction of the cube axis defining 

the shear modulus /x = c 44 , and (3) a rotation about a 

i 

cubic axis defining a shear modulus pt\ = 2 (c n — c 12 ). 
The ratio /x//x \ is the elastic anisotropy factor and 
in elastically isotropic crystals it is unity with 2 c 44 = 

Alternatively, the strain may be related to the stress, e.g. 
s x = sn(T xx + s\ 2 <Jyy + s\^(7 zz + . . ., in which case the 
constants s\\,s 12 , • • • , Sij are called elastic compliances. 
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Table 7.1 Elastic constants of cubic crystals (GN/m 2 ) 


Metal 

cu 

C12 

C44 

2 C44 / (< cu - C12 ) 

Na 

006.0 

004.6 

005.9 

8.5 

K 

004.6 

003.7 

002.6 

5.8 

Fe 

237.0 

141.0 

116.0 

2.4 

W 

501.0 

198.0 

151.0 

1.0 

Mo 

460.0 

179.0 

109.0 

0.77 

A1 

108.0 

62.0 

28.0 

1.2 

Cu 

170.0 

121.0 

75.0 

3.3 

Ag 

120.0 

90.0 

43.0 

2.9 

Au 

186.0 

157.0 

42.0 

3.9 

Ni 

250.0 

160.0 

118.0 

2.6 

/kbrass 

129.1 

109.7 

82.4 

8.5 


c\\ — c\ 2 \ the constants are all interrelated with cn = 

4 2 

K + 3 /X, C\2 = K — 3 /X and C44 = /x. 

Table 7.1 shows that most metals are far from 
isotropic and, in fact, only tungsten is isotropic; the 
alkali metals and ^-compounds are mostly anisotropic. 
Generally, 2 c 44 > (cn — c 12 ) and hence, for most elas- 
tically anisotropic metals E is maximum in the (111) 
and minimum in the (100) directions. Molybde- 
num and niobium are unusual in having the reverse 
anisotropy when E is greatest along (100) directions. 
Most commercial materials are polycrystalline, and 
consequently they have approximately isotropic prop- 
erties. For such materials the modulus value is usually 
independent of the direction of measurement because 
the value observed is an average for all directions, in 
the various crystals of the specimen. However, if dur- 
ing manufacture a preferred orientation of the grains in 
the polycrystalline specimen occurs, the material will 
behave, to some extent, like a single crystal and some 
‘directionality’ will take place. 

7.2.2 Elastic deformation of ceramics 

At ambient temperatures the profile of the stress ver- 
sus strain curve for a conventional ceramic is similar 
to that of a non-ductile metal and can be described 
as linear-elastic, remaining straight until the point of 
fracture is approached. The strong interatomic bonding 
of engineering ceramics confers mechanical stiffness. 
Moduli of elasticity (elastic, shear) can be much higher 
than those of metallic materials. In the case of sin- 
gle ceramic crystals, these moduli are often highly 
anisotropic (e.g. alumina). However, in their polycrys- 
talline forms, ceramics are often isotropic as a result 
of the randomizing effect of processing (e.g. isostatic 
pressing); nevertheless, some processing routes pre- 
serve anisotropic tendencies (e.g. extrusion). (Glasses 
are isotropic, of course.) Moduli are greatly influ- 
enced by the presence of impurities, second phases 
and porosity; for instance, the elastic modulus of a 
ceramic is lowered as porosity is increased. As the 
temperature of testing is raised, elastic moduli usually 
show a decrease, but there are exceptions. 


7.3 Plastic deformation 

7.3.1 Slip and twinning 

The limit of the elastic range cannot be defined 
exactly but may be considered to be that value of 
the stress below which the amount of plasticity (irre- 
versible deformation) is negligible, and above which 
the amount of plastic deformation is far greater than 
the elastic deformation. If we consider the deforma- 
tion of a metal in a tensile test, one or other of two 
types of curve may be obtained. Figure 7.1a shows the 
stress -strain curve characteristic of iron, from which 
it can be seen that plastic deformation begins abruptly 
at A and continues initially with no increase in stress. 
The point A is known as the yield point and the stress 
at which it occurs is the yield stress. Figure 7.1b shows 
a stress -strain curve characteristic of copper, from 
which it will be noted that the transition to the plastic 
range is gradual. No abrupt yielding takes place and in 
this case the stress required to start macroscopic plastic 
flow is known as the flow stress. 

Once the yield or flow stress has been exceeded 
plastic or permanent deformation occurs, and this is 
found to take place by one of two simple processes, 
slip (or glide) and twinning. During slip, shown in 
Figure 7.9a, the top half of the crystal moves over 
the bottom half along certain crystallographic planes, 
known as slip planes, in such a way that the atoms 
move forward by a whole number of lattice vectors; 
as a result the continuity of the lattice is maintained. 
During twinning (Figure 7.9b) the atomic movements 
are not whole lattice vectors, and the lattice generated 
in the deformed region, although the same as the parent 
lattice, is oriented in a twin relationship to it. It will 
also be observed that in contrast to slip, the sheared 
region in twinning occurs over many atom planes, the 
atoms in each plane being moved forward by the same 
amount relative to those of the plane below them. 

7.3.2 Resolved shear stress 

All working processes such as rolling, extrusion, forg- 
ing etc. cause plastic deformation and, consequently, 
these operations will involve the processes of slip or 
twinning outlined above. The stress system applied 
during these working operations is often quite com- 
plex, but for plastic deformation to occur the presence 
of a shear stress is essential. The importance of shear 


r r 



Figure 7.9 Slip and twinning in a crystal. 
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stresses becomes clear when it is realized that these 
stresses arise in most processes and tests even when 
the applied stress itself is not a pure shear stress. This 
may be illustrated by examining a cylindrical crys- 
tal of area A in a conventional tensile test under a 
uniaxial load P. In such a test, slip occurs on the 
slip plane, shown shaded in Figure 7.10, the area of 
which is A/cos0, where 0 is the angle between the 
normal to the plane OH and the axis of tension. The 
applied force P is spread over this plane and may be 
resolved into a force normal to the plane along OH, 
P cos 0, and a force along OS, P sin 0. Here, OS is the 
line of greatest slope in the slip plane and the force 
P sin 0 is a shear force. It follows that the applied stress 
(force/area) is made up of two stresses, a normal stress 
(P/A) cos 2 0 tending to pull the atoms apart, and a 
shear stress (P/A)cos0 sin0 trying to slide the atoms 
over each other. 

In general, slip does not take place down the line 
of greatest slope unless this happens to coincide with 
the crystallographic slip of direction. It is necessary, 
therefore, to know the resolved shear stress on the slip 
plane and in the slip direction. Now, if OT is taken 
to represent the slip direction the resolved shear stress 
will be given by 

a = P cos 0 sin 0 cos x/A 

where x is the angle between OS and OT. Usually this 
formula is written more simply as 

a = P cos 0 cos X/A (7.4) 

where k is the angle between the slip direction OT and 
the axis of tension. It can be seen that the resolved 
shear stress has a maximum value when the slip plane 
is inclined at 45° to the tensile axis, and becomes 
smaller for angles either greater than or less than 45°. 
When the slip plane becomes more nearly perpendic- 
ular to the tensile axis (0 > 45°) it is easy to imagine 
that the applied stress has a greater tendency to pull 
the atoms apart than to slide them. When the slip 
plane becomes more nearly parallel to the tensile axis 
(0 < 45°) the shear stress is again small but in this 
case it is because the area of the slip plane, A/cos0, 
is correspondingly large. 



Figure 7.10 Relation between the slip plane, slip direction 
and the axis of tension for a cylindrical crystal. 


A consideration of the tensile test in this way shows 
that it is shear stresses which lead to plastic defor- 
mation, and for this reason the mechanical behaviour 
exhibited by a material will depend, to some extent, 
on the type of test applied. For example, a ductile 
material can be fractured without displaying its plastic 
properties if tested in a state of hydrostatic or triax- 
ial tension, since under these conditions the resolved 
shear stress on any plane is zero. Conversely, materials 
which normally exhibit a tendency to brittle behaviour 
in a tensile test will show ductility if tested under con- 
ditions of high shear stresses and low tension stresses. 
In commercial practice, extrusion approximates closely 
to a system of hydrostatic pressure, and it is common 
for normally brittle materials to exhibit some ductility 
when deformed in this way (e.g. when extruded). 

7.3.3 Relation of slip to crystal structure 

An understanding of the fundamental nature of plastic 
deformation processes is provided by experiments on 
single crystals only, because if a polycrystalline sample 
is used the result obtained is the average behaviour of 
all the differently oriented grains in the material. Such 
experiments with single crystals show that, although 
the resolved shear stress is a maximum along lines of 
greatest slope in planes at 45° to the tensile axis, slip 
occurs preferentially along certain crystal planes and 
directions. Three well-established laws governing the 
slip behaviour exist, namely: (1) the direction of slip 
is almost always that along which the atoms are most 
closely packed, (2) slip usually occurs on the most 
closely packed plane, and (3) from a given set of slip 
planes and directions, the crystal operates on that sys- 
tem (plane and direction) for which the resolved shear 
stress is largest. The slip behaviour observed in fee 
metals shows the general applicability of these laws, 
since slip occurs along (110) directions in {111} 
planes. In eph metals slip occurs along (1 12 0) direc- 
tions, since these are invariably the closest packed, 
but the active slip plane depends on the value of the 
axial ratio. Thus, for the metals cadmium and zinc, 
c/a is 1.886 and 1.856, respectively, the planes of 
greatest atomic density are the {0 001} basal planes 
and slip takes place on these planes. When the axial 
ratio is appreciably smaller than the ideal value of 
c/a = 1.633 the basal plane is not so closely packed, 
nor so widely spaced, as in cadmium and zinc, and 
other slip planes operate. In zirconium (c/a = 1.589) 
and titanium (c/a — 1.587), for example, slip takes 
place on the { 1 0 1 0} prism planes at room temperature 
and on the {101 1 } pyramidal planes at higher tem- 
peratures. In magnesium the axial ratio (c/a = 1.624) 
approximates to the ideal value, and although only 
basal slip occurs at room temperature, at temperatures 
above 225°C slip on the {1 0 1 1} planes has also been 
observed. Bcc metals have a single well-defined close- 
packed (111) direction, but several planes of equally 
high density of packing, i.e. {1 1 2}, {1 1 0} and {1 2 3}. 



The choice of slip plane in these metals is often influ- 
enced by temperature and a preference is shown for 
{112} below T m /4, {110} from T m / 4 to T m /2 and 
{12 3} at high temperatures, where T m is the melt- 
ing point. Iron often slips on all the slip planes at 
once in a common (111) slip direction, so that a 
slip line (i.e. the line of intersection of a slip plane 
with the outer surface of a crystal) takes on a wavy 
appearance. 

7.3.4 Law of critical resolved shear stress 

This law states that slip takes place along a given slip 
plane and direction when the shear stress reaches a 
critical value. In most crystals the high symmetry of 
atomic arrangement provides several crystallographic 
equivalent planes and directions for slip (i.e. cph crys- 
tals have three systems made up of one plane contain- 
ing three directions, fee crystals have twelve systems 
made up of four planes each with three directions, 
while bcc crystals have many systems) and in such 
cases slip occurs first on that plane and along that 
direction for which the maximum stress acts (law 3 
above). This is most easily demonstrated by testing in 
tension a series of zinc single crystals. Then, because 
zinc is cph in structure only one plane is available for 
the slip process and the resultant stress -strain curve 
will depend on the inclination of this plane to the 
tensile axis. The value of the angle 0 is determined 
by chance during the process of single-crystal growth, 
and consequently all crystals will have different values 
of 0, and the corresponding stress -strain curves will 
have different values of the flow stress as shown in 
Figure 7.11a. However, because of the criterion of a 
critical resolved shear stress, a plot of resolved shear 
stress (i.e. the stress on the glide plane in the glide 
direction) versus strain should be a common curve, 
within experimental error, for all the specimens. This 
plot is shown in Figure 7.11b. 

The importance of a critical shear stress may be 
demonstrated further by taking the crystal which has 


15 ° 
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its basal plane oriented perpendicular to the tensile 
axis, i.e. 0 = 0°, and subjecting it to a bend test. In 
contrast to its tensile behaviour, where it is brittle it 
will now appear ductile, since the shear stress on the 
slip plane is only zero for a tensile test and not for a 
bend test. On the other hand, if we take the crystal with 
its basal plane oriented parallel to the tensile axis (i.e. 
0 = 90°) this specimen will appear brittle whatever 
stress system is applied to it. For this crystal, although 
the shear force is large, owing to the large area of the 
slip plane, A/ cos 0, the resolved shear stress is always 
very small and insufficient to cause deformation by 
slipping. 

7.3.5 Multiple slip 

The fact that slip bands, each consisting of many slip 
lines, are observed on the surface of deformed crystals 
shows that deformation is inhomogeneous, with exten- 
sive slip occurring on certain planes, while the crystal 
planes lying between them remain practically unde- 
formed. Figures 7.12a and 7.12b show such a crystal 
in which the set of planes shear over each other in the 
slip direction. In a tensile test, however, the ends of 
a crystal are not free to move ‘sideways’ relative to 
each other, since they are constrained by the grips of 
the tensile machine. In this case, the central portion of 
the crystal is altered in orientation, and rotation of both 
the slip plane and slip direction into the axis of ten- 
sion occurs, as shown in Figure 7.12c. This behaviour 
is more conveniently demonstrated on a stereographic 
projection of the crystal by considering the rotation of 
the tensile axis relative to the crystal rather than vice 
versa. This is illustrated in Figure 7.13a for the defor- 
mation of a crystal with fee structure. The tensile axis, 
P, is shown in the unit triangle and the angles between 
P and [101], and P and (111) are equal to X and 0, 
respectively. The active slip system is the (1 1 1) plane 
and the [TO 1] direction, and as deformation proceeds 
the change in orientation is represented by the point, P, 



Figure 7.11 Schematic representation of (a) variation of stress versus elongation with orientation of basal plane and 
(b) constancy of revolved shear stress. 
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I direction j 

(b) (c) 

Figure 7.12 (a) and (b) show the slip process in an 
unconstrained single crystal; (c) illustrates the plastic 
bending in a crystal gripped at its ends. 



(a) 


Figure 7.13 Stereographic representation of (a) slip systems 
in fee crystals and (b) overshooting of the primary slip 
system. 

moving along the zone, shown broken in Figure 7.13a, 
towards [101], i.e. A decreasing and 0 increasing. 

As slip occurs on the one system, the primary sys- 
tem, the slip plane rotates away from its position of 
maximum resolved shear stress until the orientation of 
the crystal reaches the [001] — [1 1 1] symmetry line. 
Beyond this point, slip should occur equally on both 
the primary system and a second system (the conjugate 
system) (1 1 1) [01 1], since these two systems receive 
equal components of shear stress. Subsequently, during 
the process of multiple or duplex slip the lattice will 
rotate so as to keep equal stresses on the two active sys- 
tems, and the tensile axis moves along the symmetry 
line towards [1 12]. This behaviour agrees with early 


observations on virgin crystals of aluminium and cop- 
per, but not with those made on certain alloys, or pure 
metal crystals given special treatments (e.g. quenched 
from a high temperature or irradiated with neutrons). 
Results from the latter show that the crystal continues 
to slip on the primary system after the orientation has 
reached the symmetry line, causing the orientation to 
overshoot this line, i.e. to continue moving towards 
[101], in the direction of primary slip. After a certain 
amount of this additional primary slip the conjugate 
system suddenly operates, and further slip concen- 
trates itself on this system, followed by overshooting 
in the opposite direction. This behaviour, shown in 
Figure 7.13b, is understandable when it is remembered 
that slip on the conjugate system must intersect that on 
the primary system, and to do this is presumably more 
difficult than to ‘fit’ a new slip plane in the relatively 
undeformed region between those planes on which slip 
has already taken place. This intersection process is 
more difficult in materials which have a low stacking 
fault energy (e.g. a-brass). 

7.3.6 Relation between work-hardening and 
slip 

The curves of Figure 7.1 show that following the yield 
phenomenon a continual rise in stress is required to 
continue deformation, i.e. the flow stress of a deformed 
metal increases with the amount of strain. This resis- 
tance of the metal to further plastic flow as the defor- 
mation proceeds is known as work-hardening. The 
degree of work-hardening varies for metals of different 
crystal structure, and is low in hexagonal metal crys- 
tals such as zinc or cadmium, which usually slip on 
one family of planes only. The cubic crystals harden 
rapidly on working but even in this case when slip 
is restricted to one slip system (see the curve for 
specimen A, Figure 7.14) the coefficient of harden- 
ing, defined as the slope of the plastic portion of the 
stress -strain curve, is small. Thus this type of harden- 
ing, like overshoot, must be associated with the inter- 
action which results from slip on intersecting families 
of planes. This interaction will be dealt with more fully 
in Section 7.6.2. 
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Figure 7.14 Stress-strain curves for aluminium deformed 
by single and multiple slip (after Liicke and Lange, 1950). 
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(a) (b) 

Figure 7.15 Diagram showing structure of edge dislocation during gliding from (a) equilibrium to (b) metastable position. 


7.4 Dislocation behaviour during 
plastic deformation 
7.4.1 Dislocation mobility 

The ease with which crystals can be plastically 
deformed at stresses many orders of magnitude less 
than the theoretical strength (r t = fib/lna) is quite 
remarkable, and due to the mobility of dislocations. 
Figure 7.15a shows that as a dislocation glides through 
the lattice it moves from one symmetrical lattice 
position to another and at each position the dislocation 
is in neutral equilibrium, because the atomic forces 
acting on it from each side are balanced. As the 
dislocation moves from these symmetrical lattice 
positions some imbalance of atomic forces does 
exist, and an applied stress is required to overcome 
this lattice friction. As shown in Figure 7.15b, an 
intermediate displacement of the dislocation also leads 
to an approximately balanced force system. 

The lattice friction depends rather sensitively on the 
dislocation width w and has been shown by Peierls and 
Nabarro to be given by 

r ~ /x exp [— 2nw/b\ (7.5) 

for the shear of a rectangular lattice of interpla- 
nar spacing a with w = iib/2n(\ — v)r t = a(l — v). 
The friction stress is therefore often referred to as 
the Peierls -Nabarro stress. The two opposing factors 
affecting w are (1) the elastic energy of the crystal, 
which is reduced by spreading out the elastic strains, 
and (2) the misfit energy, which depends on the num- 
ber of misaligned atoms across the slip plane. Metals 
with close-packed structures have extended disloca- 
tions and hence w is large. Moreover, the close-packed 
planes are widely spaced with weak alignment forces 
between them (i.e. have a small b/a factor). These 
metals have highly mobile dislocations and are intrin- 
sically soft. In contrast, directional bonding in crystals 
tends to produce narrow dislocations, which leads to 
intrinsic hardness and brittleness. Extreme examples 
are ionic and ceramic crystals and the covalent mate- 
rials such as diamond and silicon. The bcc transition 
metals display intermediate behaviour (i.e. intrinsically 
ductile above room temperatures but brittle below). 


Direct measurements of dislocation velocity v have 
now been made in some crystals by means of the etch 
pitting technique; the results of such an experiment are 
shown in Figure 7.16. Edge dislocations move faster 
than screws, because of the frictional drag of jogs on 
screws, and the velocity of both varies rapidly with 



Figure 7.16 Stress dependence of the velocity of edge and 
screw dislocations in lithium fluoride (from Johnston and 
Gilman, 1959; courtesy of the American Institute of Physics). 
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(b) [1 0 8 N/m 2 ] Stress 

Figure 7.17 (a) Correlation between stress to cause dislocation motion and the macro-yield stresses of crystals, (b) Edge 
dislocation motions in Fe-3% Si crystals ( after Stein and Low, 1960; courtesy of the American Institute of Physics). 


applied stress r according to an empirical relation of 
the form v = (r/r 0 ) n , where r 0 is the stress for unit 
speed and n is an index which varies for different 
materials. At high stresses the velocity may approach 
the speed of elastic waves ~10 3 m/s. The index n is 
usually low (<10) for intrinsically hard, covalent crys- 
tals such as Ge, ~40 for bcc crystals, and high (~200) 
for intrinsically soft fee crystals. It is observed that a 
critical applied stress is required to start the disloca- 
tions moving and denotes the onset of microplasticity. 
A macroscopic tensile test is a relatively insensitive 
measure of the onset of plastic deformation and the 
yield or flow stress measured in such a test is related 
not to the initial motion of an individual dislocation but 
to the motion of a number of dislocations at some finite 
velocity, e.g. ~10 nm/s as shown in Figure 17.17a. 
Decreasing the temperature of the test or increasing the 
strain-rate increases the stress level required to produce 
the same finite velocity (see Figure 7.17b), i.e. dis- 
placing the velocity- stress curve to the right. Indeed, 
hardening the material by any mechanism has the 
same effect on the dislocation dynamics. This obser- 
vation is consistent with the increase in yield stress 
with decreasing temperature or increasing strain-rate. 
Most metals and alloys are hardened by cold working 
or by placing obstacles (e.g. precipitates) in the path 


of moving dislocations to hinder their motion. Such 
strengthening mechanisms increase the stress neces- 
sary to produce a given finite dislocation velocity in a 
similar way to that found by lowering the temperature. 

7.4.2 Variation of yield stress with 
temperature and strain rate 

The high Peierls-Nabarro stress, which is associated 
with materials with narrow dislocations, gives rise to a 
short-range barrier to dislocation motion. Such barriers 
are effective only over an atomic spacing or so, hence 
thermal activation is able to aid the applied stress in 
overcoming them. Thermal activation helps a portion 
of the dislocation to cross the barrier after which glide 
then proceeds by the sideways movement of kinks. 
(This process is shown in Figure 7.29, Section 7.4.8.) 
Materials with narrow dislocations therefore exhibit 
a significant temperature- sensitivity; intrinsically hard 
materials rapidly lose their strength with increasing 
temperature, as shown schematically in Figure 7.18a. 
In this diagram the (yield stress/modulus) ratio is plot- 
ted against T/T m to remove the effect of modulus 
which decreases with temperature. Figure 7.18b shows 
that materials which exhibit a strong temperature- 
dependent yield stress also exhibit a high strain-rate 
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Figure 7.18 Variation of yield stress with (a) temperature, (b) 
(iv) covalent-bonded structure. 

sensitivity, i.e. the higher the imposed strain rate, the 
higher the yield stress. This arises because thermal 
activation is less effective at the faster rate of defor- 
mation. 

In bcc metals a high lattice friction to the move- 
ment of a dislocation may arise from the dissocia- 
tion of a dislocation on several planes. As discussed 
in Chapter 4, when a screw dislocation with Burgers 
vector a 1 2[1 1 1] lies along a symmetry direction it 
can dissociate on three crystallographically equivalent 
planes. If such a dissociation occurs, it will be nec- 
essary to constrict the dislocation before it can glide 
in any one of the slip planes. This constriction will 
be more difficult to make as the temperature is low- 
ered so that the large temperature dependence of the 
yield stress in bcc metals, shown in Figure 7.18a and 
also Figure 7.30, may be due partly to this effect. In 
fee metals the dislocations lie on {111} planes, and 
although a dislocation will dissociate in any given 
(111) plane, there is no direction in the slip plane 
along which the dislocation could also dissociate on 
other planes; the temperature-dependence of the yield 
stress is small as shown in Figure 7.18a. In eph metals 
the dissociated dislocations moving in the basal plane 
will also have a small Peierls force and be glissile with 
low temperature-dependence. However, screw dislo- 
cations moving on non-basal planes (i.e. prismatic 
and pyramidal planes) may have a high Peierls force 
because they are able to extend in the basal plane as 
shown in Figure 7.19. Hence, constrictions will once 
again have to be made before the screw dislocations 
can advance on non-basal planes. This effect con- 
tributes to the high critical shear stress and strong 
temperature-dependence of non-basal glide observed 
in this crystal system, as mentioned in Chapter 4. 

7.4.3 Dislocation source operation 

When a stress is applied to a material the specimen 
plastically deforms at a rate governed by the strain rate 
of the deformation process (e.g. tensile testing, rolling, 
etc.) and the strain rate imposes a particular velocity 
on the mobile dislocation population. In a crystal of 
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(b) 
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Figure 7.19 Dissociation in the basal plane of a screw 
dislocation moving on a non-basal glide plane. 



Figure 7.20 Shear produced by gliding dislocations. 

dimensions L\ x L 2 x 1 cm shown in Figure 7.20 a 
dislocation with velocity v moves through the crystal 
in time t = L\/v and produces a shear strain b/Li, i.e. 
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Figure 7.21 Successive stages in the operation of a Frank-Read source. The plane of the paper is assumed to be the slip plane. 


the strain rate is bv/L \L 2 . If the density of glissible 
dislocations is p, the total number of dislocations 
which become mobile in the crystal is pL\L 2 and the 
overall strain rate is thus given by 


y 


b v 

uu 


pL\L 2 = pbv 


(7.6) 


At conventional strain rates (e.g. Is -1 ) the disloca- 
tions would be moving at quite moderate speeds of 
a few cm/s if the mobile density ~10 7 /cm 2 . During 
high-speed deformation the velocity approaches the 
limiting velocity. The shear strain produced by these 
dislocations is given by 

y = pbx (7.7) 


where x is the average distance a dislocation moves. 
If the distance x ~ 10 -4 cm (the size of an average 
sub-grain) the maximum strain produced by p ~ 10 7 
is about (10 7 x 3 x 10 -8 x 10 -4 ) which is only a 
fraction of 1%. In practice, shear strains >100% can 
be achieved, and hence to produce these large strains 
many more dislocations than the original ingrown 
dislocations are required. To account for the increase 
in number of mobile dislocations during straining the 
concept of a dislocation source has been introduced. 
The simplest type of source is that due to Frank and 
Read and accounts for the regenerative multiplication 
of dislocations. A modified form of the Frank-Read 
source is the multiple cross-glide source, first proposed 
by Koehler, which, as the name implies, depends on 
the cross-slip of screw dislocations and is therefore 
more common in metals of intermediate and high 
stacking fault energy. 

Figure 7.21 shows a Frank-Read source consisting 
of a dislocation line fixed at the nodes A and B 
(fixed, for example, because the other dislocations that 
join the nodes do not lie in slip planes). Because 
of its high elastic energy (~4 eV per atom plane 
threaded by a dislocation) the dislocation possesses a 
line tension tending to make it shorten its length as 
much as possible (position 1, Figure 7.21). This line 
tension T is roughly equal to apib 2 , where pi is the 
shear modulus, b the Burgers vector and a a constant 
usually taken to be about Under an applied stress 
the dislocation line will bow out, decreasing its radius 
of curvature until it reaches an equilibrium position in 
which the line tension balances the force due to the 
applied stress. Increasing the applied stress causes the 


line to decrease its radius of curvature further until it 
becomes semi-circular (position 2). Beyond this point 
it has no equilibrium position so it will expand rapidly, 
rotating about the nodes and taking up the succession 
of forms indicated by 3, 4 and 5. Between stages 4 
and 5 the two parts of the loop below AB meet and 
annihilate each other to form a complete dislocation 
loop, which expands into the slip plane and a new 
line source between A and B. The sequence is then 
repeated and one unit of slip is produced by each loop 
that is generated. 

To operate the Frank- Read source the force applied 
must be sufficient to overcome the restoring force on 
the dislocation line due to its line tension. Referring to 
Figure 7.22 this would be 2TdO/2 > rbldO/2, and if 
T ~ ptb 2 / 2 the stress to do this is about ptb/l, where 
pi and b have their usual meaning and / is the length 
of the Frank-Read source; the substitution of typical 
values (pi = 4 x 10 10 N m -2 , b = 2.5 x 10 -10 m, and 
/ = 10 -6 m) into this estimate shows that a critical 
shear stress of about 100 gf mm -2 is required. This 
value is somewhat less than but of the same order as 
that observed for the yield stress of virgin pure metal 
single crystals. Another source mechanism involves 
multiple cross-slip as shown in Figure 7.23. It depends 


rb 



Figure 7.22 Geometry of Frank-Read source used to 
calculate the stress to operate. 
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on the Frank-Read principle but does not require a 
dislocation segment to be anchored by nodes. Thus, if 
part of a moving screw dislocation undergoes double 
cross-slip the two pieces of edge dislocation on the 
cross-slip plane effectively act as anchoring points 
for a new source. The loop expanding on the slip 
plane parallel to the original plane may operate as 
a Frank-Read source and any loops produced may 
in turn cross slip and become a source. This process 
therefore not only increases the number of dislocations 
on the original slip plane but also causes the slip band 
to widen. 

The concept of the dislocation source accounts 
for the observation of slip bands on the surface of 
deformed metals. The amount of slip produced by 
the passage of a single dislocation is too small to be 
observable as a slip line or band under the light micro- 
scope. To be resolved it must be at least 300 nm in 
height and hence ~1000 dislocations must have oper- 
ated in a given slip band. Moreover, in general, the slip 
band has considerable width, which tends to support 
the operation of the cross-glide source as the predom- 
inant mechanism of dislocation multiplication during 
straining. 

7.4.4 Discontinuous yielding 

In some materials the onset of macroscopic plastic 
flow begins in an abrupt manner with a yield drop 
in which the applied stress falls, during yielding, from 
an upper to a lower yield point. Such yield behaviour 
is commonly found in iron containing small amounts 
of carbon or nitrogen as impurity. The main char- 
acteristics of the yield phenomenon in iron may be 
summarized as follows. 

Yield point A specimen of iron during tensile defor- 
mation (Figure 7.24a, curve 1) behaves elastically up 



to a certain high load A, known as the upper yield 
point, and then it suddenly yields plastically. The 
important feature to note from this curve is that the 
stress required to maintain plastic flow immediately 
after yielding has started is lower than that required to 
start it, as shown by the fall in load from A to B (the 
lower yield point). A yield point elongation to C then 
occurs after which the specimen work hardens and the 
curve rises steadily and smoothly. 

Overstraining The yield point can be removed tem- 
porarily by applying a small preliminary plastic strain 
to the specimen. Thus, if after reaching the point D, for 
example, the specimen is unloaded and a second test is 
made fairly soon afterwards, a stress -strain curve of 
type 2 will be obtained. The specimen deforms elasti- 
cally up to the unloading point, D, and the absence of 
a yield point at the beginning of plastic flow is char- 
acteristic of a specimen in an overstrained condition. 

Strain-age hardening If a specimen which has been 
overstrained to remove the yield point is allowed to 
rest, or age, before retesting, the yield point returns as 
shown in Figure 7.24a, curve 3. This process, which is 
accompanied by hardening (as shown by the increased 
stress, EF, to initiate yielding) is known as strain- 
ageing or, more specifically, strain-age hardening. In 
iron, strain-ageing is slow at room temperature but is 
greatly speeded up by annealing at a higher tempera- 
ture. Thus, a strong yield point returns after an ageing 
treatment of only a few seconds at 200° C, but the same 
yield point will take many hours to develop if ageing 
is carried out at room temperature. 

Liiders band formation Closely related to the yield 
point is the formation of Liiders bands. These bands 
are markings on the surface of the specimen which 
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Figure 7.24 Schematic representation of (a) strain ageing and (h) Liiders band formation. 
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distinguish those parts of the specimen that have 
yielded, A, from those which have not, B. Arrival at 
the upper yield point is indicated by the formation 
of one or more of these bands and as the specimen 
passes through the stage of the yield point elongation 
these bands spread along the specimen and coalesce 
until the entire gauge length has been covered. At this 
stage the whole of the material within the gauge length 
has been overstrained, and the yield point elongation 
is complete. The growth of a Luders band is shown 
diagrammatically in Figure 7.24b. It should be noted 
that the band is a macroscopic band crossing all 
the grains in the cross-section of a poly crystalline 
specimen, and thus the edges of the band are not 
necessarily the traces of individual slip planes. A 
second point to observe is that the rate of plastic flow 
in the edges of a band can be very high even in an 
apparently slow test; this is because the zones, marked 
C in Figure 7.24b, are very narrow compared with the 
gauge length. 

These Luders bands frequently occur in drawing 
and stamping operations when the surface markings 
in relief are called stretcher strains. These markings 
are unsightly in appearance and have to be avoided 
on many finished products. The remedy consists in 
overstraining the sheet prior to pressing operations, by 
means of a temper roll, or roller levelling, pass so 
that the yield phenomenon is eliminated. It is essential, 
once this operation has been performed, to carry out 
pressing before the sheet has time to strain-age; the 
use of a 4 non- ageing’ steel is an alternative remedy. 

These yielding effects are influenced by the pres- 
ence of small amounts of carbon or nitrogen atoms 
interacting with dislocations. The yield point can 
be removed by annealing at 700° C in wet-hydrogen 
atmosphere, and cannot subsequently be restored by 
any strain-ageing treatment. Conversely, exposing the 
decarburized specimen to an atmosphere of dry hydro- 
gen containing a trace of hydrocarbon at 700° C for as 
little as one minute restores the yield point. The carbon 
and nitrogen atoms can also be removed from solu- 
tion in other ways: for example, by adding to the iron 
such elements as molybdenum, manganese, chromium, 
vanadium, niobium or titanium which have a strong 
affinity for forming carbides or nitrides in steels. For 
this reason, these elements are particularly effective in 
removing the yield point and producing a non- strain 
ageing steel. 

The carbon/nitrogen atoms are important in yielding 
process because they interact with the dislocations and 
immobilize them. This locking of the dislocations is 
brought about because the strain energy due to the 
distortion of a solute atom can be relieved if it fits into 
a structural region where the local lattice parameter 
approximates to that of the natural lattice parameter of 
the solute. Such a condition will be brought about by 
the segregation of solute atoms to the dislocations, with 
large substitutional atoms taking up lattice positions in 
the expanded region, and small ones in the compressed 
region; small interstitial atoms will tend to segregate to 


interstitial sites below the half-plane. Thus, where both 
dislocations and solute atoms are present in the lattice, 
interactions of the stress field can occur, resulting in 
a lowering of the strain energy of the system. This 
provides a driving force tending to attract solute atoms 
to dislocations and if the necessary time for diffusion is 
allowed, a solute atom ‘atmosphere’ will form around 
each dislocation. 

When a stress is applied to a specimen in which the 
dislocations are locked by carbon atoms the disloca- 
tions are not able to move at the stress level at which 
free dislocations are normally mobile. With increas- 
ing stress yielding occurs when dislocations suddenly 
become mobile either by breaking away from the car- 
bon atmosphere or by nucleating fresh dislocations 
at stress concentrations. At this high stress level the 
mobile dislocation density increases rapidly. The lower 
yield stress is then the stress at which free dislocations 
continue to move and produce plastic flow. The over- 
strained condition corresponds to the situation where 
the mobile dislocations, brought to rest by unloading 
the specimen, are set in motion again by reloading 
before the carbon atmospheres have time to develop 
by diffusion. If, however, time is allowed for diffu- 
sion to take place, new atmospheres can re-form and 
immobilize the dislocations again. This is the strain- 
aged condition when the original yield characteristics 
reappear. 

The upper yield point in conventional experiments 
on polycrystalline materials is the stress at which 
initially yielded zones trigger yield in adjacent grains. 
As more and more grains are triggered the yield zones 
spread across the specimen and form a Luders band. 

The propagation of yield is thought to occur when a 
dislocation source operates and releases an avalanche 
of dislocations into its slip plane which eventually pile 
up at a grain boundary or other obstacle. The stress 
concentration at the head of the pile-up acts with the 
applied stress on the dislocations of the next grain 
and operates the nearest source, so that the process 
is repeated in the next grain. The applied shear stress 
G y at which yielding propagates is given by 

G y = G t + (<7 c r 1/2 )d _1/2 (7.8) 

where r is the distance from the pile-up to the nearest 
source, 2d is the grain diameter and g c is the stress 
required to operate a source which involves unpinning 
a dislocation r c at that temperature. Equation (7.8) 
reduces to the Hall-Petch equation G y = a* + k y d~ l/2 , 
where G t is the ‘friction’ stress term and k y the grain 
size dependence parameter (= m 2 r c r 1/2 ) discussed in 
Section 7.4.11. 

7.4.5 Yield points and crystal structure 

The characteristic feature of discontinuous yielding 
is that at the yield point the specimen goes from 
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a condition where the availability of mobile dislo- 
cations is limited to one where they are in abun- 
dance, the increase in mobile density largely aris- 
ing from dislocation multiplication at the high stress 
level. A further feature is that not all the dislo- 
cations have to be immobilized to observe a yield 
drop. Indeed, this is not usually possible because 
specimen handling, non-axial loading, scratches, etc. 
give rise to stress concentrations that provide a small 
local density of mobile dislocations (i.e. pre-yield 
microtrain). 

For materials with a high Peierls-Nabarro (P-N) 
stress, yield drops may be observed even when they 
possess a significant mobile dislocation density. A 
common example is that observed in silicon; this is 
an extremely pure material with no impurities to lock 
dislocations, but usually the dislocation density is quite 
modest (10 7 m/m 3 ) and possesses a high P-N stress. 

When these materials are pulled in a tensile test 
the overall strain rate y imposed on the specimen 
by the machine has to be matched by the motion of 
dislocations according to the relation y = pbv. How- 
ever, because p is small the individual dislocations 
are forced to move at a high speed v, which is only 
attained at a high stress level (the upper yield stress) 
because of the large P-N stress. As the dislocations 
glide at these high speeds, rapid multiplication occurs 
and the mobile dislocation density increases rapidly. 
Because of the increased value of the term pv, a lower 
average velocity of dislocations is then required to 
maintain a constant strain rate, which means a lower 
glide stress. The stress that can be supported by the 
specimen thus drops during initial yielding to the lower 
yield point, and does not rise again until the disloca- 
tion-dislocation interactions caused by the increased 
p produce a significant work-hardening. 

In the fee metals, the P-N stress is quite small and 
the stress to move a dislocation is almost indepen- 
dent of velocity up to high speeds. If such metals are 
to show a yield point, the density of mobile disloca- 
tions must be reduced virtually to zero. This can be 
achieved as shown in Figure 7.25 by the tensile test- 
ing of whisker crystals which are very perfect. Yielding 
begins at the stress required to create dislocations in the 
perfect lattice, and the upper yield stress approaches 
the theoretical yield strength. Following multiplica- 
tion, the stress for glide of these dislocations is several 
orders of magnitude lower. 

Bcc transition metals such as iron are intermediate 
in their plastic behaviour between the fee metals and 
diamond cubic Si and Ge. Because of the significant 
P-N stress these bcc metals are capable of exhibit- 
ing a sharp yield point even when the initial mobile 
dislocation density is not zero, as shown by the cal- 
culated curves of Figure 7.26. However, in practice, 
the dislocation density of well-annealed pure metals 
is about 10 10 m/m 3 and too high for any significant 
yield drop without an element of dislocation locking 
by carbon atoms. 



Figure 7.25 Yield point in a copper whisker. 
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Figure 7.26 Calculated stress-strain curves showing 
influence of initial dislocation density on the yield drop in 
iron for n = 35 with (i) 10 1 cm~ 2 , (ii) 10 3 cm~ 2 , 

(iii) 10 5 cm~ 2 and (iv) 10 7 cm~ 2 (after Hahn, 1962; 
courtesy of Pergamon Press). 


It is evident that discontinuous yielding can be 
produced in all the common metal structures provided 
the appropriate solute elements are present, and cor- 
rect testing procedure adopted. The effect is particu- 
larly strong in the bcc metals and has been observed 
in of-iron, molybdenum, niobium, vanadium and j$- 
brass each containing a strongly interacting interstitial 
solute element. The hexagonal metals (e.g. cadmium 
and zinc) can also show the phenomenon provided 
interstitial nitrogen atoms are added. The copper- 
and aluminium-based fee alloys also exhibit yielding 
behaviour but often to a lesser degree. In this case it is 
substitutional atoms (e.g. zinc in a-brass and copper in 
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aluminium alloys) which are responsible for the phe- 
nomenon (see Section 7.4.7). 

7.4.6 Discontinuous yielding in ordered alloys 

Discontinuous yield points have been observed in a 
wide variety of A 3 B-type alloys. Figure 7.27 shows 
the development of the yield point in Ni 3 Fe on ageing. 
The addition of A1 speeds up the kinetics of ordering 
and therefore the onset of the yield point. Ordered 
materials deform by superdislocation motion and the 
link between yield points and superdislocations is con- 
firmed by the observation that in Cu 3 Au, for example, 
a transition from groups of single dislocations to more 
randomly arranged superdislocation pairs takes place 
at = 0.7 (see Chapter 4) and this coincides with 
the onset of a large yield drop and rapid rise in work 
hardening. 

Sharp yielding may be explained by at least two 
mechanisms, namely (1) cross-slip of the superdislo- 
cation onto the cube plane to lower the APB energy 
effectively pinning it and (2) dislocation locking by 
rearrangement of the APB on ageing. The shear APB 
between a pair of superdislocations is likely to be 
energetically unstable since there are many like bonds 
across the interface and thermal activation will mod- 
ify this sharp interface by atomic rearrangement. This 


APB -locking model will give rise to sharp yield- 
ing because the energy required by the lead dislo- 
cation in creating sharp APB is greater than that 
released by the trailing dislocation initially moving 
across diffuse APB. Experimental evidence favours 
the APB -model and weak-beam electron microscopy 
(see Figure 7.28) shows that the superdislocation sep- 
aration for a shear APB corresponds to an energy of 
48 =b 5 mJ/m 2 , whereas a larger dislocation separation 
corresponding to an APB energy of 25 =b 3 mJ/m 2 was 
observed for a strained and aged Cu 3 Au. 

7.4.7 Solute -dislocation interaction 

Iron containing carbon or nitrogen shows very marked 
yield point effects and there is a strong elastic interac- 
tion between these solute atoms and the dislocations. 
The solute atoms occupy interstitial sites in the lat- 
tice and produce large tetragonal distortions as well 
as large- volume expansions. Consequently, they can 
interact with both shear and hydrostatic stresses and 
can lock screw as well as edge dislocations. Strong 
yielding behaviour is also expected in other bcc met- 
als, provided they contain interstitial solute elements. 
On the other hand, in the case of fee metals the 
arrangement of lattice positions around either intersti- 
tial or substitutional sites is too symmetrical to allow a 
solute atom to produce an asymmetrical distortion, and 



Figure 7.27 Development of a yield point with ageing at 490° C for the times indicated, (a) Ni^Fe, (b) Ni^Fe + 5% Al; the 
tests are at room temperature. 
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Figure 7.28 Weak-beam micrographs showing separation of superdislocation partials in Cu^Au. (a) As deformed, (b) after 
ageing at 225 °C ( after Morris and Smallman, 1975). 


the atmosphere locking of screw dislocations, which 
requires a shear stress interaction, would appear to be 
impossible. Then by this argument, since the screw 
dislocations are not locked, a drop in stress at the 
yield point should not be observed. Nevertheless, yield 
points are observed in fee materials and one reason 
for this is that unit dislocations in fee metals dissoci- 
ate into pairs of partial dislocations which are elasti- 
cally coupled by a stacking fault. Moreover, since their 
Burgers vectors intersect at 120° there is no orienta- 
tion of the line of the pair for which both can be pure 
screws. At least one of them must have a substantial 
edge component, and a locking of this edge component 
by hydrostatic interactions should cause a locking of 
the pair although it will undoubtedly be weaker. 

In its quantitative form the theory of solute atom 
locking has been applied to the formation of an 
atmosphere around an edge dislocation due to hydro- 
static interaction. Since hydrostatic stresses are scalar 
quantities, no knowledge is required in this case of 
the orientation of the dislocation with respect to the 
interacting solute atom, but it is necessary in calcu- 
lating shear stresses interactions. 1 Cottrell and Bilby 
have shown that if the introduction of a solute atom 
causes a volume change Av at some point in the lattice 
where the hydrostatic pressure of the stress field is /?, 
the interaction energy is 

V = pAv = K® Av (7.9) 

where K is the bulk modulus and 0 is the local dilata- 
tion strain. The dilatation strain at a point ( R , 0) from a 
positive edge dislocation is b( 1 — 2v) x sin0/27ri?(l — 

! To a first approximation a solute atom does not interact 
with a screw dislocation since there is no dilatation around 
the screw; a second-order dilatation exists however, which 
gives rise to a non-zero interaction falling off with distance 
from the dislocation according to 1/r 2 . In real crystals, 
anisotropic elasticity will lead to first-order size effects even 
with screw dislocations and hence a substantial interaction 
is to be expected. 


v), and substituting K = 2/z(l + v)/3(l — 2v), where 
/z is the shear modulus and v Poisson’s ratio, we get 
the expression 

V (r,o) = b( 1 + v)/xAv sm0/3nR(l — v) 

= AsinO/R (7.10) 

This is the interaction energy at a point whose polar 
coordinates with respect to the centre of the dislocation 
are R and 0. We note that V is positive on the upper 
side (0 < 6 < n) of the dislocation for a large atom 
(Av > 0), and negative on the lower side, which 
agrees with the qualitative picture of a large atom being 
repelled from the compressed region and attracted into 
the expanded one. 

It is expected that the site for the strongest bind- 
ing energy V max will be at a point 0 = 3n/ 2, R = 
r 0 — b; and using known values of pi, v and Av in 
equation (7.10) we obtain A~3x 10~ 29 N m 2 and 
V max — 1 eV for carbon or nitrogen in o'-iron. This 
value is almost certainly too high because of the limi- 
tations of the interaction energy equation in describing 
conditions near the centre of a dislocation, and a more 
realistic value obtained from experiment (e.g. internal 
friction experiments) is V max — \ to | eV. For a sub- 
stitutional solute atom such as zinc in copper Av is 
not only smaller but also easier to calculate from lat- 
tice parameter measurements. Thus, if r and r( 1 + e) 
are the atomic radii of the solvent and solute, respec- 
tively, where s is the misfit value, the volume change 
Av is 4nr 3 £ and equation (7.10) becomes 

V = 4(1 + v) fiber 3 sin 0/3(1 — v)R 

= Asin0/R (7.11) 

Taking the known values [i — 40 GN/m 2 , v = 0.36, 
b = 2.55 x 10 -10 m, r 0 and s = 0.06, we find A ~ 
5 x 10- 30 N m 2 , which gives a much lower binding 
energy, V max = | eV. 

The yield phenomenon is particularly strong in iron 
because an additional effect is important; this concerns 
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the type of atmosphere a dislocation gathers round 
itself which can be either condensed or dilute. Dur- 
ing the strain-ageing process migration of the solute 
atoms to the dislocation occurs and two important 
cases arise. First, if all the sites at the centre of the 
dislocation become occupied the atmosphere is then 
said to be condensed; each atom plane threaded by the 
dislocation contains one solute atom at the position 
of maximum binding together with a diffuse cloud of 
other solute atoms further out. If, on the other hand, 
equilibrium is established before all the sites at the 
centre are saturated, a steady state must be reached 
in which the probability of solute atoms leaving the 
centre can equal the probability of their entering it. 
The steady- state distribution of solute atoms around 
the dislocations is then given by the relation 

C(R,e) — c o exp [V(R,o)/kT] 

where Co is the concentration far from a dislocation, k 
is Boltzmann’s constant, T is the absolute temperature 
and c the local impurity concentration at a point near 
the dislocation where the binding energy is V. This 
is known as the dilute or Maxwellian atmosphere. 
Clearly, the form of an atmosphere will be governed 
by the concentration of solute atoms at the sites of 
maximum binding energy, F max and for a given alloy 
(i.e. c 0 and V max fixed) this concentration will be 

<V max = c 0 exp (V ma JkT) (7.12) 

as long as cy max is less than unity. The value of c Vmax 
depends only on the temperature, and as the temper- 
ature is lowered c Vmax will eventually rise to unity. 
By definition the atmosphere will then have passed 
from a dilute to a condensed state. The temperature at 
which this occurs is known as the condensation tem- 
perature T c , and can be obtained by substituting the 
value cy max = 1 in equation (7.12) when 

T c = F max /kln(l/c 0 ) (7.13) 

Substituting the value of V max for iron, i.e. ^ eV in this 
equation we find that only a very small concentration 
of carbon or nitrogen is necessary to give a condensed 
atmosphere at room temperature, and with the usual 
concentration strong yielding behaviour is expected up 
to temperatures of about 400° C. 

In the fee structure although the locking between a 
solute atom and a dislocation is likely to be weaker, 
condensed atmospheres are still possible if this weak- 
ness can be compensated for by sufficiently increasing 
the concentration of the solution. This may be why 
examples of yielding in fee materials have been mainly 
obtained from alloys. Solid solution alloys of alu- 
minium usually contain less than 0.1 at. % of solute 
element, and these show yielding in single crystals 
only at low temperature (e.g. liquid nitrogen tempera- 
ture, — 196°C) whereas supersaturated alloys show evi- 
dence of strong yielding even in polycrystals at room 
temperature; copper dissolved in aluminium has a mis- 
fit value £ — 0. 12 which corresponds to V max = | eV, 


and from equation (7.13) it can be shown that a 0.1 at. 
% alloy has a condensation temperature T c = 250 K. 
Copper-based alloys, on the other hand, usually form 
extensive solid solutions, and, consequently, concen- 
trated alloys may exhibit strong yielding phenomena. 

The best-known example is a-brass and, because 
V max — | eV, a dilute alloy containing 1 at. % zinc 
has a condensation temperature T c ~ 300 K. At low 
zinc concentrations (1-10%) the yield point in brass 
is probably solely due to the segregation of zinc atoms 
to dislocations. At higher concentrations, however, it 
may also be due to short-range order. 

7.4.8 Dislocation locking and temperature 

The binding of a solute atom to a dislocation is short 
range in nature, and is effective only over an atomic 
distance or so (Figure 7.29). Moreover, the dislocation 
line is flexible and this enables yielding to begin by 
throwing forward a small length of dislocation line, 
only a few atomic spacings long, beyond the position 
marked x 2 . The applied stress then separates the rest 
of the dislocation line from its anchorage by pulling 
the sides of this loop outward along the dislocation 
line, i.e. by double kink movement. Such a breakaway 
process would lead to a yield stress which depends 
sensitively on temperature, as shown in Figure 7.30a. 
It is observed, however, that k y , the grain- size depen- 
dence parameter in the Hall-Petch equation, in most 
annealed bcc metals is almost independent of tem- 
perature down to the range (<100 K) where twinning 
occurs, and that practically all the large temperature- 
dependence is due to Gi (see Figure 7.30b). To explain 
this observation it is argued that when locked disloca- 
tions exist initially in the material, yielding starts by 
unpinning them if they are weakly locked (this corre- 
sponds to the condition envisaged by Cottrell -Bilby), 
but if they are strongly locked it starts instead by 



Figure 7.29 Stress -displacement curve for the breakaway 
of a dislocation from its atmosphere ( after Cottrell, 1957; 
courtesy of the Institution of Mechanical Engineers ). 
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Figure 7.30 Variation of lower yield stress with (a) temperature and (b) grain size, for low -carbon steel and niobium; the 
curve for nickel is shown in (a) for comparison (after Adams, Roberts and Smallman, 1960; Hull and Mogford, 1958). 


the creation of new dislocations at points of stress 
concentration. This is an athermal process and thus 
k y is almost independent of temperature. Because of 
the rapid diffusion of interstitial elements the con- 
ventional annealing and normalizing treatments should 
commonly produce strong locking. In support of this 
theory, it is observed that k y is dependent on tempera- 
ture in the very early stages of ageing following either 
straining or quenching but on subsequent ageing k y 
becomes temperature-independent. The interpretation 
of k y therefore depends on the degree of ageing. 

Direct observations of crystals that have yielded 
show that the majority of the strongly anchored dis- 
locations remain locked and do not participate in the 
yielding phenomenon. Thus large numbers of dislo- 
cations are generated during yielding by some other 
mechanism than breaking away from Cottrell atmo- 
spheres, and the rapid dislocation multiplication, which 
can take place at the high stress levels, is now con- 
sidered the most likely possibility. Prolonged ageing 
tends to produce coarse precipitates along the disloca- 
tion line and unpinning by bowing out between them 
should easily occur before grain boundary creation. 
This unpinning process would also give k y independent 
of temperature. 

7.4.9 Inhomogeneity interaction 

A different type of elastic interaction can exist which 
arises from the different elastic properties of the 


solvent matrix and the region near a solute. Such an 
inhomogeneity interaction has been analysed for both 
a rigid and a soft spherical region; the former cor- 
responds to a relatively hard impurity atom and the 
latter to a vacant lattice site. The results indicate that 
the interaction energy is of the form B/r 2 where B is a 
constant involving elastic constants and atomic size. It 
is generally believed that the inhomogeneity effect is 
small for solute -dislocation interactions but dominates 
the size effect for vacancy -dislocation interaction. The 
kinetics of ageing support this conclusion. 

7.4.10 Kinetics of strain-ageing 

Under a force F an atom migrating by thermal agi- 
tation acquires a steady drift velocity v = DF /kT (in 
addition to its random diffusion movements) in the 
direction of the F, where D is the coefficient of diffu- 
sion. The force attracting a solute atom to a dislocation 
is the gradient of the interaction energy dV/dr and 
hence v = (D/kT)(A/r 2 ). Thus atoms originally at a 
distance r from the dislocation reach it in a time given 
approximately by 

t = r/v = r 3 kr /AD 

After this time t the number of atoms to reach unit 
length of dislocation is 

n(t) = nr 2 Co = ttcq[(AD / kT)t ] 2/3 
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where cq is the solute concentration in uniform solution 
in terms of the number of atoms per unit volume. If 
p is the density of dislocations (cm/cm 3 ) and / the 
fraction of the original solute which has segregated to 
the dislocation in time t then, 

/ = np[(AD/kT)r\ 2 / 3 (7.14) 

This expression is valid for the early stages of ageing, 
and may be modified to fit the later stages by allowing 
for the reduction in the matrix concentration as ageing 
proceeds, such that the rate of flow is proportional to 
the amount left in the matrix, 

df/dt = 7tp(AD/kT) 2/3 (2/3 )r 2/3 ( 1 - /) 
which when integrated gives 

/ = 1 - exp {-7tp[(AD/kT)t] l/3 } (7.15) 

This reduces to the simpler equation (7.14) when the 
exponent is small, and is found to be in good agree- 
ment with the process of segregation and precipitation 
on dislocations in several bcc metals. For carbon in 
of-Fe, Harper determined the fraction of solute atom 
still in solution using an internal friction technique 
and showed that log (1 — /) is proportional to t 2/ 3 ; 
the slope of the line is np(AD/kT) and evaluation of 
this slope at a series of temperatures allows the acti- 
vation energy for the process to be determined from 
an Arrhenius plot. The value obtained for o'-iron is 
84 kJ/mol which is close to that for the diffusion of 
carbon in ferrite. 

The inhomogeneity interaction is considered to be 
the dominant effect in vacancy -dislocation interac- 
tions, with V = — B/r 2 where B is a constant; this com- 
pares with the size effect for which V = —A/r would 
be appropriate for the interstitial -dislocation interac- 
tion. It is convenient, however, to write the interaction 
energy in the general form V = —A/r n and hence, fol- 
lowing the treatment previously used for the kinetics 
of strain-ageing, the radial velocity of a point defect 
towards the dislocation is 

V = (D/kT)(nA/r n+1 ) (7.16) 

The number of a particular point defect specie that 
reach the dislocation in time t is 

n{t ) = nr 2 Co 

= nco[ADn(n + 2)/kT] 2/(n+2 V /(n+2) (7.17) 

and when n = 2 then n(t) a t l/2 , and when n = 1, 
n(t) a t 2/3 . Since the kinetics of ageing in quenched 
copper follow t l/2 initially, the observations confirm 
the importance of the inhomogeneity interaction for 
vacancies. 

7.4.11 Influence of grain boundaries on 
plasticity 

It might be thought that when a stress is applied 
to a polycrystalline metal, every grain in the sample 


deforms as if it were an unconstrained single crys- 
tal. This is not the case, however, and the fact that 
the aggregate does not deform in this manner is indi- 
cated by the high yield stress of polycrystals compared 
with that of single crystals. This increased strength 
of polycrystals immediately poses the question-is the 
hardness of a grain caused by the presence of the grain 
boundary or by the orientation difference of the neigh- 
bouring grains? It is now believed that the latter is 
the case but that the structure of the grain boundary 
itself may be of importance in special circumstances 
such as when brittle films, due to bismuth in copper 
or cementite in steel, form around the grains or when 
the grains slip past each other along their boundaries 
during high- temperature creep. The importance of the 
orientation change across a grain boundary to the pro- 
cess of slip has been demonstrated by experiments on 
‘bamboo’ -type specimens, i.e. where the grain bound- 
aries are parallel to each other and all perpendicular to 
the axis of tension. Initially, deformation occurs by slip 
only in those grains most favourably oriented, but later 
spreads to all the other grains as those grains which 
are deformed first, work harden. It is then found that 
each grain contains wedge-shaped areas near the grain 
boundary, as shown in Figure 7.31a, where slip does 
not operate, which indicates that the continuance of 
slip from one grain to the next is difficult. From these 
observations it is natural to enquire what happens in a 
completely poly crystalline metal where the slip planes 
must in all cases make contact with a grain bound- 
ary. It will be clear that the poly crystalline aggregate 
must be stronger because, unlike the deformation of 
bamboo-type samples where it is not necessary to raise 
the stress sufficiently high to operate those slip planes 
which made contact with a grain boundary, all the slip 
planes within any grain of a poly crystalline aggregate 
make contact with a grain boundary, but, nevertheless, 
have to be operated. The importance of the grain size 


Slip systems 



Slip band 



Figure 7.31 (a) Grain-boundary blocking of slip, 
(b) Blocking of a slip band by a grain boundary. 
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on a strength is emphasized by Figure 7.30b, which 
shows the variation in lower yield stress, o y , with grain 
diameter, 2d , for low-carbon steel. The smaller the 
grain size, the higher the yield strength according to a 
relation of the form 

a y = cr; + faT 1/2 (7.18) 

where 07 is a lattice friction stress and k a constant 
usually denoted k y to indicate yielding. Because of 
the difficulties experienced by a dislocation in moving 
from one grain to another, the process of slip in 
a polycrystalline aggregate does not spread to each 
grain by forcing a dislocation through the boundary. 
Instead, the slip band which is held up at the boundary 
gives rise to a stress concentration at the head of 
the pile-up group of dislocations which acts with the 
applied stress and is sufficient to trigger off sources 
in neighbouring grains. If r* is the stress a slip band 
could sustain if there were no resistance to slip across 
the grain boundary, i.e. the friction stress, and r the 
higher stress sustained by a slip band in a polycrystal, 
then (r — r,-) represents the resistance offered by the 
boundary, which reaches a limiting value when slip 
is induced in the next grain. The influence of grain 
size can be explained if the length of the slip band 
is proportional to d as shown in Figure 7.31(b). Thus, 
since the stress concentration a short distance r from 
the end of the slip band is proportional to (d/4r) 1/2 , 
the maximum shear stress at a distance r ahead of a 
slip band carrying an applied stress r in a polycrystal 
is given by (r — r*)[d/4r] 1/2 and lies in the plane of 
the slip band. If this maximum stress has to reach a 
value r max to operate a new source at a distance r then 

(r - T,-)[d/4r] 1/2 = r max 
or, rearranging, 

t = T/ + (T max 2 r 1 / 2 )d ~ 1/2 
which may be written as 

r = Ti + k s d ~ l/2 

It then follows that the tensile flow curve of a poly- 
crystal is given by 

a — m(Ti + k s d~ l/2 ) (7.19) 

where m is the orientation factor relating the applied 
tensile stress a to the shear stress, i.e. a = mr. For 
a single crystal the m-factor has a minimum value 
of 2 as discussed, but in polycrystals deformation 
occurs in less favourably oriented grains and some- 
times (e.g. hexagonal, intermetallics, etc.) on ‘hard’ 
systems, and so the m-factor is significantly higher. 
From equation (7.18) it can be seen that 07 = mi/ and 
k = mk s . 

While there is an orientation factor on a macroscopic 
scale in developing the critical shear stress within the 
various grains of a polycrystal, so there is a local ori- 
entation factor in operating a dislocation source ahead 


of a blocked slip band. The slip plane of the sources 
will not, in general, lie in the plane of maximum shear 
stress, and hence r max will need to be such that the 
shear stress, r c required to operate the new source 
must be generated in the slip plane of the source. In 
general, the local orientation factor dealing with the 
orientation relationship of adjacent grains will differ 
from the macroscopic factor of slip plane orientation 
relative to the axis of stress, so that r max = \m r r c . For 
simplicity, however, it will be assumed m' = m and 
hence the parameter k in the Petch equation is given 
by k = m 2 r c r 1/2 . 

It is clear from the above treatment that the parame- 
ter k depends essentially on two main factors. The first 
is the stress to operate a source dislocation, and this 
depends on the extent to which the dislocations are 
anchored or locked by impurity atoms. Strong locking 
implies a large r c and hence a large k; the converse is 
true for weak locking. The second factor is contained 
in the parameter m which depends on the number of 
available slip systems. A multiplicity of slip systems 
enhances the possibility for plastic deformation and so 
implies a small k. A limited number of slip systems 
available would imply a large value of k. It then fol- 
lows, as shown in Figure 7.32, that for (1) fee metals, 
which have weakly locked dislocations and a multi- 
plicity of slip systems, k will generally be small, i.e. 
there is only a small grain size dependence of the flow 
stress, (2) eph metals, k will be large because of the 
limited slip systems, and (3) bcc metals, because of 
the strong locking, k will be large. 

Each grain does not deform as a single crystal in 
simple slip, since, if this were so, different grains 
would then deform in different directions with the 
result that voids would be created at the grain bound- 
aries. Except in high-temperature creep, where the 
grains slide past each other along their boundaries, this 
does not happen and each grain deforms in coherence 
with its neighbouring grains. However, the fact that 



Figure 7.32 Schematic diagram showing the grain 
size-dependence of the yield stress for crystals of different 
crystal structure. 
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the continuity of the metal is maintained during plas- 
tic deformation must mean that each grain is deformed 
into a shape that is dictated by the deformation of its 
neighbours. Such behaviour will, of course, require the 
operation of several slip systems, and von Mises has 
shown that to allow this unrestricted change of shape of 
a grain requires at least five independent shear modes. 
The deformation of metal crystals with cubic structure 
easily satisfies this condition so that the polycrystals 
of these metals usually exhibit considerable ductility, 
and the stress -strain curve generally lies close to that 
of single crystals of extreme orientations deforming 
under multiple slip conditions. The hexagonal metals 
do, however, show striking differences between their 
single crystal and poly crystalline behaviour. This is 
because single crystals of these metals deform by a 
process of basal plane slip, but the three shear sys- 
tems (two independent) which operate do not pro- 
vide enough independent shear mechanisms to allow 
unrestricted changes of shape in polycrystals. Conse- 
quently, to prevent gaps opening up at grain boundaries 
during the deformation of polycrystals, some addi- 
tional shear mechanisms, such as non-basal slip and 
mechanical twinning, must operate. Hence, because 
the resolved stress for non-basal slip and twinning 
is greater than that for basal-plane slip, yielding in a 
polycrystal is prevented until the applied stress is high 
enough to deform by these mechanisms. 

7.4.12 Superplasticity 

A number of materials, particularly two-phase eutec- 
tic or eutectoid alloys, have been observed to exhibit 
large elongations (~1000%) without fracture, and such 
behaviour has been termed superplasticity. Several 
metallurgical factors have been put forward to explain 
superplastic behaviour and it is now generally recog- 
nized that the effect can be produced in materials either 
(1) with a particular structural condition or (2) tested 
under special test conditions. The particular structural 
condition is that the material has a very fine grain size 
and the presence of a two-phase structure is usually of 
importance in maintaining this fine grain size during 
testing. Materials which exhibit superplastic behaviour 
under special test conditions are those for which a 
phase boundary moves through the strained material 
during the test (e.g. during temperature cycling). 

In general, the superplastic material exhibits a high 
strain-rate sensitivity. Thus the plastic flow of a solid 
may be represented by the relation 

cr = Ks m (7.20) 

where o is the stress, s the strain-rate and m an 
exponent generally known as the strain-rate sensitivity. 
When m = 1 the flow stress is directly proportional to 
strain rate and the material behaves as a Newtonian 
viscous fluid, such as hot glass. Superplastic materials 
are therefore characterized by high m-values, since 
this leads to increased stability against necking in 
a tensile test. Thus, for a tensile specimen length / 


with cross-sectional area A under an applied load P 
then dl/l = — d A/ A and introducing the time factor 
we obtain 

s = -(l/A)dA/c \t 

and if during deformation the equation a = Ke m is 
obeyed, then 

AA/At = (7.21) 

For most metals and alloys m ~ 0.1 -0.2 and the rate 
at which A changes is sensitively dependent on A, and 
hence once necking starts the process rapidly leads 
to failure. When m = 1, the rate of change of area is 
independent of A and, as a consequence, any irreg- 
ularities in specimen geometry are not accentuated 
during deformation. The resistance to necking there- 
fore depends sensitively on m, and increases markedly 
when m> 0.5. Considering, in addition, the dependence 
of the flow stress on strain, then 

a = K l s n e m (7.22) 

and in this case, the stability against necking depends 
on a factor (1 — n — m)/m , but n -values are not nor- 
mally very high. Superplastic materials such as Zn-Al 
eutectoid, Pb-Sn eutectic, Al-Cu eutectic, etc. have 
m values approaching unity at elevated temperatures. 

The total elongation increases as m increases and 
with increasing micro structural fineness of the mate- 
rial (grain-size or lamella spacing) the tendency for 
superplastic behaviour is increased. Two-phase struc- 
tures are advantageous in maintaining a fine grain size 
during testing, but exceptionally high ductilities have 
been produced in several commercially pure metals 
(e.g. Ni, Zn and Mg), for which the fine grain size was 
maintained during testing at a particular strain-rate and 
temperature. 

It follows that there must be several possible con- 
ditions leading to superplasticity. Generally, it is 
observed metallographically that the grain structure 
remains remarkably equiaxed during extensive defor- 
mation and that grain boundary sliding is a common 
deformation mode in several superplastic alloys. While 
grain boundary sliding can contribute to the overall 
deformation by relaxing the five independent mecha- 
nisms of slip, it cannot give rise to large elongations 
without bulk flow of material (e.g. grain boundary 
migration). In polycrystals, triple junctions obstruct the 
sliding process and give rise to a low m- value. Thus to 
increase the rate sensitivity of the boundary shear it is 
necessary to lower the resistance to sliding from bar- 
riers, relative to the viscous drag of the boundary; this 
can be achieved by grain boundary migration. Indeed, 
it is observed that superplasticity is controlled by grain 
boundary diffusion. 

The complete explanation of superplasticity is still 
being developed, but it is already clear that during 
deformation individual grains or groups of grains with 
suitably aligned boundaries will tend to slide. Sliding 
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Table 7.2 Twinning elements for some common metals 


Structure 

Plane 

Direction 

Metals 

cph 

{1012} 

(loll) 

Zn, Cd, Be, Mg 

bcc 

{112} 

<111} 

Fe, ^-brass, W, Ta, Nb, V, Cr, Mo 

fee 

{111} 

<112) 

Cu, Ag, Au, Ag-Au, Cu-Al 

Tetragonal 

{331} 

— 

Sn 

Rhombohedral 

{001} 

— 

Bi, As, Sb 


continues until obstructed by a protrusion in a grain 
boundary, when the local stress generates dislocations 
which slip across the blocked grain and pile up at the 
opposite boundary until the back stress prevents further 
generation of dislocations and thus further sliding. At 
the temperature of the test, dislocations at the head of 
the pile-up can climb into and move along grain bound- 
aries to annihilation sites. The continual replacement 
of these dislocations would permit grain boundary slid- 
ing at a rate governed by the rate of dislocation climb, 
which in turn is governed by grain boundary diffusion. 
It is important that any dislocations created by local 
stresses are able to traverse yielded grains and this 
is possible only if the ‘dislocation cell size’ is larger 
than, or at least of the order of, the grain size, i.e. a few 
microns. At high strain-rates and low temperatures the 
dislocations begin to tangle and form cell structures 
and superplasticity then ceases. 

The above conditions imply that any metal in which 
the grain size remains fine during deformation could 
behave superplastic ally; this conclusion is borne out in 
practice. The stability of grain size can, however, be 
achieved more readily with a fine micro-duplex struc- 
ture as observed in some Fe-20Cr-6Ni alloys when 
hot worked to produce a fine dispersion of austen- 
ite and ferrite. Such stainless steels have an attrac- 
tive combination of properties (strength, toughness, 
fatigue strength, weldability and corrosion resistance) 
and unlike the usual range of two-phase stainless steels 
have good hot workability if 0.5Ti is added to pro- 
duce a random distribution of TiC rather than Cr 2 3 C 6 
at ferrite-austenite boundaries. 

Superplastic forming is now an established and 
growing industry largely using vacuum forming to 
produce intricate shapes with high draw ratios. Two 
alloys which have achieved engineering importance 
are Supral (containing Al-6Cu-0.5Zr) and IMI 318 
(containing Ti-6A1-4V). Supral is deformed at 460° C 
and IMI 318 at 900°C under argon. Although the 
process is slow, the loads required are also low and the 
process can be advantageous in the press-forming field 
to replace some of the present expensive and complex 
forming technology. 

7.5 Mechanical twinning 

7.5.1 Crystallography of twinning 

Mechanical twinning plays only a minor part in the 
deformation of the common metals such as copper 


or aluminium, and its study has consequently been 
neglected. Nevertheless, twinning does occur in all the 
common crystal structures under some conditions of 
deformation. Table 7.2 shows the appropriate twinning 
elements for the common structures. 

The geometrical aspects of twinning can be 
represented with the aid of a unit sphere shown in 
Figure 7.33. The twinning plane k\ intersects the plane 
of the drawing in the shear direction Y]\. On twinning 
the unit sphere is distorted to an ellipsoid of equal 
volume, and the shear plane k\ remains unchanged 
during twinning, while all other planes become tilted. 
Distortion of planes occurs in all cases except k\ 
and k 2 . The shear strain, s, at unit distance from 
the twinning plane is related to the angle between 
k\ and k 2 . Thus the amount of shear is fixed by the 
crystallographic nature of the two undistorted planes. 
In the bcc lattice, the two undistorted planes are the 
( 112 ) and ( 112 ) planes, displacement occurring in a 
[111] direction a distance of 0.707 lattice vectors. The 
twinning elements are thus: 


k, 

k 2 

hi 

h 2 

Shear 

( 112 ) 

( 112 ) 

[HI] 

[ill] 

0.707 


where k\ and k 2 denote the first and second undistorted 
planes, respectively, and 371 and 372 denote directions 
lying in k\ and k 2 , respectively, perpendicular to the 
line of intersection of these planes. k\ is also called the 
composition or twinning plane, while 371 is called the 
shear direction. The twins consist of regions of crystal 



Figure 7.33 Crystallography of twinning. 
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in which a particular set of {112} planes (the k\ set 
of planes) is homogeneously sheared by 0.707 in a 
(111) direction (the r)\ direction). The same atomic 
arrangement may be visualized by a shear of 1.414 in 
the reverse (111) direction, but this larger shear has 
never been observed. 

7.5.2 Nucleation and growth of twins 

During the development of mechanical twins, thin 
lamellae appear very quickly (~ speed of sound) and 
these thicken with increasing stress by the steady 
movement of the twin interface. New twins are usually 
formed in bursts and are sometimes accompanied by a 
sharp audible click which coincides with the appear- 
ance of irregularities in the stress -strain curve, as 
shown in Figure 7.34. The rapid production of clicks is 
responsible for the so-called twinning cry (e.g. in tin). 

Although most metals show a general reluctance 
to twin, when tested under suitable conditions they 
can usually be made to do so. As mentioned in 
Section 7.3.1, the shear process involved in twinning 
must occur by the movement of partial dislocations 
and, consequently, the stress to cause twinning will 
depend not only on the line tension of the source dis- 
location, as in the case of slip, but also on the surface 
tension of the twin boundary. The stress to cause twin- 
ning is, therefore, usually greater than that required for 
slip and at room temperature deformation will nearly 
always occur by slip in preference to twinning. As the 


deformation temperature is lowered the critical shear 
stress for slip increases, and then, because the general 
stress level will be high, the process of deformation 
twinning is more likely. 

Twinning is most easily achieved in metals of cph 
structure where, because of the limited number of 
slip systems, twinning is an essential and unavoidable 
mechanism of deformation in poly crystalline speci- 
mens (see Section 7.4.11), but in single crystals the 
orientation of the specimen, the stress level and the 
temperature of deformation are all important factors 
in the twinning process. In metals of the bcc structure 
twinning may be induced by impact at room tempera- 
ture or with more normal strain rates at low tempera- 
ture where the critical shear stress for slip is very high. 
In contrast, only a few fee metals have been made to 
twin, even at low temperatures. 

In zinc single crystals it is observed that there is no 
well-defined critical resolved shear stress for twinning 
such as exists for slip, and that a very high stress 
indeed is necessary to nucleate twins. In most crystals, 
slip usually occurs first and twin nuclei are then created 
by means of the very high stress concentration which 
exists at dislocation pile-ups. Once formed, the twins 
can propagate provided the resolved shear stress is 
higher than a critical value, because the stress to 
propagate a twin is much lower than that to nucleate 
it. This can be demonstrated by deforming a crystal 
oriented in such a way that basal slip is excluded, 


(a) (b) 



Figure 7.34 (a) Effect of grain size on the stress-strain curves of specimens of niobium extended at a rate of 
2 .02 x 10 ~ 4 s~ J at 20 K; (1) grain size 2d = 1 .414 mm, (2) grain size 2d = 0 .312 mm, (3) grain size 2d = 0.0951 mm, 
(4) grain size 2d = 0 .0476 mm. (b) Deformation twins in specimen 1 and specimen 3 extended to fracture. Etched in 95% 
HNOs +5% HF (after Adams, Roberts and Smallman, 1960). 
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i.e. when the basal planes are nearly parallel to the 
specimen axis. Even in such an oriented crystal it is 
found that the stress to cause twinning is higher than 
that for slip on non-basal planes. In this case, non- 
basal slip occurs first so that when a dislocation pile-up 
arises and a twin is formed, the applied stress is so high 
that an avalanche or burst of twins results. 

It is also believed that in the bcc metals twin 
nucleation is more difficult than twin propagation. One 
possible mechanism is that nucleation is brought about 
by the stress concentration at the head of a piled-up 
array of dislocations produced by a burst of slip as 
a Frank- Read source operates. Such a behaviour is 
favoured by impact loading, and it is well known that 
twin lamellae known as Neumann bands are produced 
this way in o'-iron at room temperature. At normal 
strain rates, however, it should be easier to produce 
a slip burst suitable for twin nucleation in a material 
with strongly locked dislocations, i.e. one with a large 
k value (as defined by equation (7.19)) than one in 
which the dislocation locking is relatively slight (small 
k values). In this context it is interesting to note 
that both niobium and tantalum have a small k value 
and, although they can be made to twin, do so with 
reluctance compared, for example, with o'-iron. 

In all the bcc metals the flow stress increases so 
rapidly with decreasing temperature (see Figure 7.30), 
that even with moderate strain rates (ICC 4 s -1 ) o'-iron 
will twin at 77 K, while niobium with its smaller value 
of k twins at 20 K. The type of stress -strain behaviour 
for niobium is shown in Figure 7.34a. The pattern of 
behaviour is characterized by small amounts of slip 
interspersed between extensive bursts of twinning in 
the early stages of deformation. Twins, once formed, 
may themselves act as barriers, allowing further dislo- 
cation pile-up and further twin nucleation. The action 
of twins as barriers to slip dislocations could presum- 
ably account for the rapid work-hardening observed 
at 20 K. 

Fee metals do not readily deform by twinning but 
it can occur at low temperatures, and even at 0°C, in 
favourably oriented crystals. The apparent restriction 
of twinning to certain orientations and low tempera- 
tures may be ascribed to the high shear stress attained 
in tests on crystals with these orientations, since the 
stress necessary to produce twinning is high. Twinning 
has been confirmed in heavily rolled copper. The exact 
mechanism for this twinning is not known, except that 
it must occur by the propagation of a half-dislocation 
and its associated stacking fault across each plane of a 
set of parallel (111) planes. For this process the half- 
dislocation must climb onto successive twin planes, as 
below for bcc iron. 

7.5.3 Effect of impurities on twinning 

It is well established that solid solution alloying 
favours twinning in fee metals. For example, sil- 
ver-gold alloys twin far more readily than the pure 
metals. Attempts have been made to correlate this 


effect with stacking fault energy and it has been 
shown that the twinning stress of copper-based alloys 
increases with increasing stacking fault energy. Twin- 
ning is also favoured by solid solution alloying in 
bcc metals, and alloys of Mo-Re, W-Re and Nb-V 
readily twin at room temperature. In this case it 
has been suggested that the lattice frictional stress 
is increased and the ability to cross-slip reduced by 
alloying, thereby confining slip dislocations to bands 
where stress multiplication conducive to twin nucle- 
ation occurs. 

7.5.4 Effect of prestrain on twinning 

Twinning can be suppressed in most metals by a cer- 
tain amount of prestrain; the ability to twin may be 
restored by an ageing treatment. It has been suggested 
that the effect may be due to the differing dislocation 
distribution produced under different conditions. For 
example, niobium will normally twin at — 196°C, when 
a heterogeneous arrangement of elongated screw dis- 
locations capable of creating the necessary stress con- 
centrations are formed. Room temperature prestrain, 
however, inhibits twin formation as the regular net- 
work of dislocations produced provides more mobile 
dislocations and homogenizes the deformation. 

7.5.5 Dislocation mechanism of twinning 

In contrast to slip, the shear involved in the twinning 
process is homogeneous throughout the entire twinning 
region, and each atom plane parallel to the twinning 
plane moves over the one below it by only a fraction 
of a lattice spacing in the twinning direction. Never- 
theless, mechanical twinning is thought to take place 
by a dislocation mechanism for the same reasons as 
slip but the dislocations that cause twinning are partial 
and not unit dislocations. From the crystallography of 
the process it can be shown that twinning in the eph 
lattice, in addition to a simple shear on the twinning 
plane, must be accompanied by a localized rearrange- 
ment of the atoms, and furthermore, only in the bcc 
lattice does the process of twinning consist of a simple 
shear on the twinning plane (e.g. a twinned structure 
in this lattice can be produced by a shear of 1 /\/2 in 
a (1 1 1) direction on a {1 1 2} plane). 

An examination of Figure 7.9 shows that the main 
problem facing any theory of twinning is to explain 
how twinning develops homogeneously through suc- 
cessive planes of the lattice. This could be accom- 
plished by the movement of a single twinning (par- 
tial) dislocation successively from plane to plane. One 
suggestion, similar in principle to the crystal growth 
mechanism, is the pole-mechanism proposed by Cot- 
trell and Bilby illustrated in Figure 7.35a. Here, OA, 
OB, and OC are dislocation lines. The twinning dis- 
location is OC, which produces the correct shear as it 
sweeps through the twin plane about its point of emer- 
gence O, and OA and OB form the pole dislocation, 
being partly or wholly of screw character with a pitch 
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Figure 7.35 (a) Diagram illustrating the pole mechanism of twinning, (b) The formation of a crack at a twin intersection in 
silicon- iron (after Hull, 1960). 


equal to the spacing of the twinning layers. The twin- 
ning dislocation rotates round the pole dislocation and 
in doing so, not only produces a monolayer sheet of 
twinned crystal but also climbs up the ‘pole’ to the 
next layer. The process is repeated and a thick layer 
of twin is built up. 

The dislocation reaction involved is as follows. The 
line AOB represents a unit dislocation with a Burgers 
vector a/2[l 1 1] and that part OB of the line lies 
in the (112) plane. Then, under the action of stress 
dissociation of this dislocation can occur according to 
the reaction 

a/ 2[1 1 1] — > a/ 3[1 1 2] + a/ 6[1 1 T] 

The dislocation with vector a/6[\ 1 1] forms a line 
OC lying in one of the other {112} twin planes (e.g. 
the (121) plane) and produces the correct twinning 
shear. The line OB is left with a Burgers vector 
<2/3 [1 12] which is of pure edge type and sessile in 
the (1 1 2) plane. 

7.5.6 Twinning and fracture 

It has been suggested that a twin, like a grain boundary, 
may present a strong barrier to slip and that a crack 
can be initiated by the pile-up of slip dislocations at 
the twin interface (see Figure 8.32). In addition, cracks 
may be initiated by the intersection of twins, and 
examples are common in molybdenum, silicon -iron 
(bcc) and zinc (cph). Figure 7.35b shows a very good 
example of crack nucleation in 3% silicon -iron; the 
crack has formed along an {001} cleavage plane at 
the intersection of two {112} twins, and part of the 


crack has developed along one of the twins in a zigzag 
manner while still retaining {001} cleavage facets. 

In tests at low temperature on bcc and cph metals 
both twinning and fracture readily occur, and this 
has led to two conflicting views. First, that twins are 
nucleated by the high stress concentrations associated 
with fracture, and second, that the formation of twins 
actually initiates the fracture. It is probable that both 
effects occur. 

7.6 Strengthening and hardening 

mechanisms 

7.6.1 Point defect hardening 

The introduction of point defects into materials to pro- 
duce an excess concentration of either vacancies or 
interstitials often gives rise to a significant change in 
mechanical properties (Figures 7.36 and 7.37). For alu- 
minium the shape of the stress -strain curve is very 
dependent on the rate of cooling and a large increase 
in the yield stress may occur after quenching. We 
have already seen in Chapter 4 that quenched-in vacan- 
cies result in clustered vacancy defects and these may 
harden the material. Similarly, irradiation by high- 
energy particles may produce irradiation-hardening 
(see Figure 7.37). Information on the mechanisms of 
hardening can be obtained from observation of the 
dependence of the lower yield stress on grain size. 
The results, reproduced in Figure 7.37b, show that the 
relation <r y = o t + k y d ~ l/ 2 , which is a general relation 
describing the propagation of yielding in materials, is 
obeyed. 
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Figure 7.36 Effect of quenching on the stress-strain curves from (a) aluminium (after Maddin and Cottrell, 1955), and 
(b) gold (after Adams and Smallman, unpublished). 



Figure 7.37 (a) Stress- strain curves for unirradiated and irradiated fine-grained polycrystalline copper, tested at 20° C; 
(b) variation of yield stress with grain size and neutron dose (after Adams and Higgins, 1959). 


This dependence of the yield stress, <r y , on grain 
size indicates that the hardening produced by point 
defects introduced by quenching or irradiation, is of 
two types: (1) an initial dislocation source hardening 
and (2) a general lattice hardening which persists after 
the initial yielding. The k y term would seem to indicate 
that the pinning of dislocations may be attributed to 
point defects in the form of coarsely spaced jogs, 
and the electron-microscope observations of jogged 
dislocations would seem to confirm this. 

The lattice friction term o t is clearly responsi- 
ble for the general level of the stress -strain curve 
after yielding and arises from the large density of 
dislocation defects. However, the exact mechanisms 


whereby loops and tetrahedra give rise to an increased 
flow stress is still controversial. Vacancy clusters are 
believed to be formed in situ by the disturbance intro- 
duced by the primary collision, and hence it is not 
surprising that neutron irradiation at 4 K hardens the 
material, and that thermal activation is not essential. 

Unlike dispersion-hardened alloys, the deformation 
of irradiated or quenched metals is characterized by a 
low initial rate of work hardening (see Figure 7.36). 
This has been shown to be due to the sweeping out 
of loops and defect clusters by the glide disloca- 
tions, leading to the formation of cleared channels. 
Diffusion-controlled mechanisms are not thought to be 
important since defect-free channels are produced by 
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deformation at 4 K. The removal of prismatic loops 
both unfaulted and faulted and tetrahedra can occur 
as a result of the strong coalescence interactions with 
screws to form helical configurations and jogged dislo- 
cations when the gliding dislocations and defects make 
contact. Clearly, the sweeping-up process occurs only 
if the helical and jogged configurations can glide eas- 
ily. Resistance to glide will arise from jogs not lying 
in slip planes and also from the formation of sessile 
jogs (e.g. Lomer- Cottrell dislocations in fee crystals). 

7.6.2 Work-hardening 

7.6.2. 1 Theoretical treatment 
The properties of a material are altered by cold- 
working, i.e. deformation at a low temperature rel- 
ative to its melting point, but not all the properties 
are improved, for although the tensile strength, yield 
strength and hardness are increased, the plasticity and 
general ability to deform decreases. Moreover, the 
physical properties such as electrical conductivity, den- 
sity and others are all lowered. Of these many changes 
in properties, perhaps the most outstanding are those 
that occur in the mechanical properties; the yield stress 
of mild steel, for example, may be raised by cold work 
from 170 up to 1050 MN/m 2 . 

Such changes in mechanical properties are, of 
course, of interest theoretically, but they are also 
of great importance in industrial practice. This is 
because the rate at which the material hardens during 
deformation influences both the power required and the 
method of working in the various shaping operations, 
while the magnitude of the hardness introduced 
governs the frequency with which the component must 
be annealed (always an expensive operation) to enable 
further working to be continued. 

Since plastic flow occurs by a dislocation mecha- 
nism the fact that work-hardening occurs means that it 
becomes difficult for dislocations to move as the strain 
increases. All theories of work-hardening depend on 
this assumption, and the basic idea of hardening, put 
forward by Taylor in 1934, is that some dislocations 
become ‘stuck’ inside the crystal and act as sources 
of internal stress which oppose the motion of other 
gliding dislocations. 

One simple way in which two dislocations could 
become stuck is by elastic interaction. Thus, two par- 
allel edge dislocations of opposite sign moving on 
parallel slip planes in any sub-grain may become stuck, 
as a result of the interaction discussed in Chapter 4. 
G. I. Taylor assumed that dislocations become stuck 
after travelling an average distance, L, while the den- 
sity of dislocations reaches p, i.e. work-hardening is 
due to the dislocations getting in each other’s way. 
The flow stress is then the stress necessary to move 
a dislocation in the stress field of those dislocations 
surrounding it. This stress r is quite generally given by 

(7.23) 



bJ p x 10 3 


Figure 7.38 Dependence of flow stress on ( dislocation 
density) 1 ! 2 for Cu, Ag and Cu-Al. 

where pi is the shear modulus, b the Burgers vector, 
/ the mean distance between dislocations which is 
~p _1/2 , and a a constant; in the Taylor model a = 
l/8:/r(l — v). Figure 7.38 shows such a relationship for 
Cu-Al single crystals and poly crystalline Ag and Cu. 

In his theory Taylor considered only a two- 
dimensional model of a cold-worked metal. However, 
because plastic deformation arises from the movement 
of dislocation loops from a source, it is more 
appropriate to assume that when the plastic strain is 
y, N dislocation loops of side L (if we assume for 
convenience that square loops are emitted) have been 
given off per unit volume. The resultant plastic strain 


is then given by 

y = NL 2 b 

(7.24) 

and / by 

/ ~ [l/p 1/2 ] = [1/4LN] 1 ' 2 

(7.25) 

Combining these equations, the stress - 

-strain relation 

r = const. ( b/L) 1/2 y 1/ 2 

(7.26) 


is obtained. Taylor assumed L to be a constant, i.e. 
the slip lines are of constant length, which results in a 
parabolic relationship between r and y. 

Taylor’s assumption that during cold work the den- 
sity of dislocations increases has been amply veri- 
fied, and indeed the parabolic relationship between 
stress and strain is obeyed, to a first approximation, 
in many poly crystalline aggregates where deformation 
in all grains takes place by multiple slip. Experimen- 
tal work on single crystals shows, however, that the 
work- or strain-hardening curve may deviate consider- 
ably from parabolic behaviour, and depends not only 
on crystal structure but also on other variables such as 
crystal orientation, purity and surface conditions (see 
Figures 7.39 and 7.40). 

The crystal structure is important (see Figure 7.39) 
in that single crystals of some hexagonal metals slip 
only on one family of slip planes, those parallel to 
the basal plane, and these metals show a low rate of 


r = aptb/l 
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Figure 7.39 Stress-strain curves of single crystals ( after 
Hirsch and Mitchell, 1967; courtesy of the National 
Research Council of Canada). 



Figure 7.40 Stress-strain curve showing the three stages of 
work hardening. 

work-hardening. The plastic part of the stress -strain 
curve is also more nearly linear than parabolic with 
a slope which is extremely small: this slope (dr/d y) 
becomes even smaller with increasing temperature of 
deformation. Cubic crystals, on the other hand, are 
capable of deforming in a complex manner on more 
than one slip system, and these metals normally show 
a strong work-hardening behaviour. The influence of 
temperature depends on the stress level reached dur- 
ing deformation and on other factors which must be 
considered in greater detail. However, even in cubic 
crystals the rate of work-hardening may be extremely 
small if the crystal is restricted to slip on a single 
slip system. Such behaviour points to the conclusion 
that strong work-hardening is caused by the mutual 
interference of dislocations gliding on intersecting slip 
planes. 

Many theories of work-hardening similar to that of 
Taylor exist but all are oversimplified, since work- 
hardening depends not so much on individual dislo- 
cations as on the group behaviour of large numbers of 
them. It is clear, therefore, that a theoretical treatment 


which would describe the complete stress -strain rela- 
tionship is difficult, and consequently the present-day 
approach is to examine the various stages of hardening 
and then attempt to explain the mechanisms likely to 
give rise to the different stages. The work-hardening 
behaviour in metals with a cubic structure is more 
complex than in most other structures because of the 
variety of slip systems available, and it is for this rea- 
son that much of the experimental evidence is related 
to these metals, particularly those with fee structures. 

7. 6. 2. 2 Three- stage hardening 
The stress -strain curve of a fee single crystal is shown 
in Figure 7.40 and three regions of hardening are 
experimentally distinguishable. Stage I, or the easy 
glide region, immediately follows the yield point and 
is characterized by a low rate of work hardening 0\ 
up to several per cent glide; the length of this region 
depends on orientation, purity and size of the crystals. 
The hardening rate ($i//x) ~ 10 -4 and is of the same 
order as for hexagonal metals. Stage II, or the linear 
hardening region, shows a rapid increase in work- 
hardening rate. The ratio (On/fi) = (dr/d y)//x is of 
the same order of magnitude for all fee metals, i.e. 
1/300 although this is ~ 1/1 50 for orientations at the 
corners of the stereographic triangle. In this stage short 
slip lines are formed during straining quite suddenly, 
i.e. in a short increment of stress Ar, and thereafter do 
not grow either in length or intensity. The mean length 
of the slip lines, L ~ 25 pm decreases with increasing 
strain. Stage III, or the parabolic hardening region, the 
onset of which is markedly dependent on temperature, 
exhibits a low rate of work-hardening, 6m, and the 
appearance of coarse slip bands. This stage sets in at 
a strain which increases with decreasing temperature 
and is probably associated with the annihilation of 
dislocations as a consequence of cross-slip. 

The low stacking fault energy metals exhibit all 
three work-hardening stages at room temperature, but 
metals with a high stacking fault energy often show 
only two stages of hardening. It is found, for example, 
that at 78 K aluminium behaves like copper at room 
temperature and exhibits all three stages, but at room 
temperature and above, stage II is not clearly devel- 
oped and stage III starts before stage II becomes at 
all predominant. This difference between aluminium 
and the noble metals is not due only to the difference 
in melting point but also to the difference in stacking 
fault energies which affects the width of extended dis- 
locations. The main effect of a change of temperature 
of deformation is, however, a change in the onset of 
stage III; the lower the temperature of deformation, the 
higher is the stress T\ \ i corresponding to the onset of 
stage III. 

Because the flow stress of a metal may be affected 
by a change of temperature or strain-rate, it has been 
found convenient to think of the stress as made up of 
two parts according to the relation 

t — r s + Tg 
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where r s is that part of the flow stress which is 
dependent on temperature apart from the variation of 
the elastic modulus /x with temperature, and r g is 
a temperature-independent contribution. The relative 
importance of r s and r g can be studied conveniently 
by measuring the dependence of flow stress on tem- 
perature or strain rate, i.e. the change in flow stress Ar 
on changing the temperature or strain rate, as function 
of deformation. 

The Stage I easy glide region in cubic crystals, with 
its small linear hardening, corresponds closely to the 
hardening of cph crystals where only one glide plane 
operates. It occurs in crystals oriented to allow only 
one glide system to operate, i.e. for orientations near 
the [01 1] pole of the unit triangle (Figure 7.13). In 
this case the slip distance is large, of the order of the 
specimen diameter, with the probability of dislocations 
slipping out of the crystal. Electron microscope obser- 
vations have shown that the slip lines on the surface 
are very long (~1 mm) and closely spaced, and that 
the slip steps are small corresponding to the passage 
of only a few dislocations. This behaviour obviously 
depends on such variables as sample size and oxide 
films, since these influence the probability of disloca- 
tions passing out of the crystal. It is also to be expected 
that the flow stress in easy glide will be governed by 
the ease with which sources begin to operate, since 
there is no slip on a secondary slip system to interfere 
with the movement of primary glide dislocations. 

As soon as another glide system becomes activated 
there is a strong interaction between dislocations on the 
primary and secondary slip systems, which gives rise 
to a steep increase in work-hardening. It is reasonable 
to expect that easy glide should end, and turbulent 
flow begin, when the crystal reaches an orientation for 
which two or more slip systems are equally stressed, 
i.e. for orientations on the symmetry line between 
[0 01] and [1 1 1]. However, easy glide generally ends 
before symmetrical orientations are reached and this is 
principally due to the formation of deformation bands 
to accommodate the rotation of the glide plane in fixed 
grips during tensile tests. This rotation leads to a high 
resolved stress on the secondary slip system, and its 
operation gives rise to those lattice irregularities which 
cause some dislocations to become ‘stopped’ in the 
crystal. The transformation to Stage II then occurs. 

The characteristic feature of deformation in Stage II 
is that slip takes place on both the primary and sec- 
ondary slip systems. As a result, several new lat- 
tice irregularities may be formed which will include 

(1) forest dislocations, (2) Lomer- Cottrell barriers, 
and (3) jogs produced either by moving dislocations 
cutting through forest dislocations or by forest dis- 
locations cutting through source dislocations. Conse- 
quently, the flow stress r may be identified, in general 
terms, with a stress which is sufficient to operate a 
source and then move the dislocations against (1) the 
internal elastic stresses from the forest dislocations, 

(2) the long-range stresses from groups of dislocations 


piled up behind barriers, and (3) the frictional resis- 
tance due to jogs. In a cold-worked metal all these 
factors may exist to some extent, but because a linear 
hardening law can be derived by using any one of the 
various contributory factors, there have been several 
theories of Stage II-hardening, namely (1) the pileup 
theory, (2) the forest theory and (3) the jog theory. 
All have been shown to have limitations in explaining 
various features of the deformation process, and have 
given way to a more phenomenological theory based 
on direct observations of the dislocation distribution 
during straining. 

Observations on fee and bcc crystals have revealed 
several common features of the microstructure which 
include the formation of dipoles, tangles and cell struc- 
tures with increasing strain. The most detailed obser- 
vations have been made for copper crystals, and these 
are summarized below to illustrate the general pattern 
of behaviour. In Stage I, bands of dipoles are formed 
(see Figure 7.41a) elongated normal to the primary 
Burgers vector direction. Their formation is associated 
with isolated forest dislocations and individual dipoles 
are about 1 pm in length and ^10 nm wide. Differ- 
ent patches are arranged at spacings of about 10 pm 
along the line of intersection of a secondary slip plane. 
With increasing strain in Stage I the size of the gaps 
between the dipole clusters decreases and therefore the 
stress required to push dislocations through these gaps 
increases. Stage II begins (see Figure 7.41b) when 
the applied stress plus internal stress resolved on the 
secondary systems is sufficient to activate secondary 
sources near the dipole clusters. The resulting local 
secondary slip leads to local interactions between pri- 
mary and secondary dislocations both in the gaps and 
in the clusters of dipoles, the gaps being filled with 
secondary dislocations and short lengths of other dis- 
locations formed by interactions (e.g. Lomer- Cottrell 
dislocations in fee crystals and a( 100) type dislo- 
cations in bcc crystals). Dislocation barriers are thus 
formed surrounding the original sources. 

In Stage II (see Figure 7.41c) it is proposed that 
dislocations are stopped by elastic interaction when 
they pass too close to an existing tangled region 
with high dislocation density. The long-range internal 
stresses due to the dislocations piling up behind are 
partially relieved by secondary slip, which transforms 
the discrete pile-up into a region of high dislocation 
density containing secondary dislocation networks and 
dipoles. These regions of high dislocation density act 
as new obstacles to dislocation glide, and since every 
new obstacle is formed near one produced at a lower 
strain, two-dimensional dislocation structures are built 
up forming the walls of an irregular cell structure. With 
increasing strain the number of obstacles increases, the 
distance a dislocation glides decreases and therefore 
the slip line length decreases in Stage II. The structure 
remains similar throughout Stage II but is reduced in 
scale. The obstacles are in the form of ribbons of high 
densities of dislocations which, like pile-ups, tend to 
form sheets. The work-hardening rate depends mainly 
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Figure 7.41 Dislocation structure observed in copper single crystals deformed in tension to (a) stage I, (b) end of easy- glide 
and beginning of stage II, (c) top of stage II, and (d) stage III (after Steeds, 1963; Crown copyright; reproduced by permission 
of the Controller, H.M. Stationery Office). 


on the effective radius of the obstacles, and this has 
been considered in detail by Hirsch and co-workers and 
shown to be a constant fraction k of the discrete pile- 
up length on the primary slip system. In general, the 
work-hardening rate is given by On = k/i/ 3n and for 
an fee crystal the small variation in k with orientation 
and alloying element is able to account for the variation 
of 0\ i with those parameters. 


The dislocation arrangement in metals with other 
structures is somewhat similar to that of copper with 
differences arising from stacking fault energy. In 
Cu- A1 alloys the dislocations tend to be confined more 
to the active slip planes, the confinement increasing 
with decreasing y S F- In Stage I dislocation multipoles 
are formed as a result of dislocations of opposite sign 
on parallel nearby slip planes ‘pairing up’ with one 
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another. Most of these dislocations are primaries. In 
Stage II the density of secondary dislocations is much 
less (~|) than that of the primary dislocations. The 
secondary slip occurs in bands and in each band slip 
on one particular secondary plane predominates. In 
niobium, a metal with high y S F, the dislocation dis- 
tribution is rather similar to copper. In Mg, typical of 
cph metals, Stage I is extensive and the dislocations 
are mainly in the form of primary edge multipoles, 
but forest dislocations threading the primary slip plane 
do not appear to be generated. 

From the curve shown in Figure 7.40 it is evident 
that the rate of work-hardening decreases in the later 
stages of the test. This observation indicates that at 
a sufficiently high stress or temperature the disloca- 
tions held up in Stage II are able to move by a process 
which at lower stresses and temperature had been sup- 
pressed. The onset of Stage III is accompanied by 
cross-slip, and the slip lines are broad, deep and con- 
sist of segments joined by cross-slip traces. Electron 
metallographic observations on sections of deformed 
crystal inclined to the slip plane (see Figure 7.4 Id) 
show the formation of a cell structure in the form 
of boundaries, approximately parallel to the primary 
slip plane of spacing about 1-3 pm plus other bound- 
aries extending normal to the slip plane as a result of 
cross-slip. 

The simplest process which is in agreement with 
the experimental observations is that the screw disloca- 
tions held up in Stage II, cross-slip and possibly return 
to the primary slip plane by double cross-slip. By this 
mechanism, dislocations can bypass the obstacles in 
their glide plane and do not have to interact strongly 
with them. Such behaviour leads to an increase in slip 
distance and a decrease in the accompanying rate of 
work-hardening. Furthermore, it is to be expected that 
screw dislocations leaving the glide plane by cross- 
slip may also meet dislocations on parallel planes and 
be attracted by those of opposite sign. Annihilation 
then takes place and the annihilated dislocation will 
be replaced, partly at least, from the original source. 
This process if repeated can lead to slip-band forma- 
tion, which is also an important experimental feature 
of Stage III. Hardening in Stage III is then due to the 
edge parts of the loops which remain in the crystal and 
increase in density as the source continues to operate. 

The importance of the value of the stacking fault 
energy, y, on the stress -strain curve is evident from 
its importance to the process of cross-slip. Low values 
of y give rise to wide stacking fault ‘ribbons’, and 
consequently cross-slip is difficult at reasonable stress 
levels. Thus, the screws cannot escape from their 
slip plane, the slip distance is small, the dislocation 
density is high and the transition from Stage II to 
Stage III is delayed. In aluminium the stacking fault 
ribbon width is very small because y has a high value, 
and cross-slip occurs at room temperature. Stage II is, 
therefore, poorly developed unless testing is carried 
out at low temperatures. These conclusions are in 


agreement with the observations of dislocation density 
and arrangement. 

7. 6. 2. 3 Work-hardening in poly crystals 
The dislocation structure developed during the defor- 
mation of fee and bcc polycrystalline metals follows 
the same general pattern as that in single crystals; 
primary dislocations produce dipoles and loops by 
interaction with secondary dislocations, which give 
rise to local dislocation tangles gradually develop- 
ing into three-dimensional networks of sub-boundaries. 
The cell size decreases with increasing strain, and the 
structural differences that are observed between var- 
ious metals and alloys are mainly in the sharpness 
of the sub-boundaries. In bcc metals, and fee metals 
with high stacking-fault energy, the tangles rearrange 
into sharp boundaries but in metals of low stacking 
fault energy the dislocations are extended, cross-slip is 
restricted, and sharp boundaries are not formed even at 
large strains. Altering the deformation temperature also 
has the effect of changing the dislocation distribution; 
lowering the deformation temperature reduces the ten- 
dency for cell formation, as shown in Figure 7.42. For 
a given dislocation distribution the dislocation density 
is simply related to the flow stress r by an equation of 
the form 

r = r 0 + anbp xjl (7.27) 

where o' is a constant at a given temperature ~0.5; 
t 0 for fee metals is zero (see Figure 7.38). The 
workhardening rate is determined by the ease with 
which tangled dislocations rearrange themselves and 
is high in materials with low y, i.e. brasses, bronzes 
and austenitic steels compared to A1 and bcc met- 
als. In some austenitic steels, work-hardening may 
be increased and better sustained by a strain-induced 
phase transformation (see Chapter 8). 



Figure 7.42 Influence of deformation strain and 
temperature on the formation of a cell structure in a -iron. 
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Grain boundaries affect work-hardening by acting as 
barriers to slip from one grain to the next. In addition, 
the continuity criterion of polycrystals enforces com- 
plex slip in the neighbourhood of the boundaries which 
spreads across the grains with increasing deformation. 
This introduces a dependence of work-hardening rate 
on grain size which extends to several per cent elonga- 
tion. After this stage, however, the work-hardening rate 
is independent of grain size and for fee poly crystals is 
about /x/40, which, allowing for the orientation factors, 
is roughly comparable with that found in single crys- 
tals deforming in multiple slip. Thus from the relations 
o — mr and s = y/m the average resolved shear stress 
on a slip plane is rather less than half the applied tensile 
stress, and the average shear strain parallel to the slip 
plane is rather more than twice the tensile elongation. 
The polycrystal work-hardening rate is thus related to 
the single-crystal work-hardening rate by the relation 

do/d£ = m 2 dr/dy (7.28) 

For bcc metals with the multiplicity of slip systems 
and the ease of cross-slip m is more nearly 2, so that 
the work-hardening rate is low. In poly crystalline eph 
metals the deformation is complicated by twinning, 
but in the absence of twinning m ~ 6.5, and hence the 
work-hardening rate is expected to be more than an 
order of magnitude greater than for single crystals, and 
also higher than the rate observed in fee polycrystals 
for which m ~ 3. 

7. 6. 2.4 Dispersion-hardened alloys 
On deforming an alloy containing incoherent, non- 
deformable particles the rate of work-hardening is 
much greater than that shown by the matrix alone 
(see Figure 8.10). The dislocation density increases 
very rapidly with strain because the particles produce 
a turbulent and complex deformation pattern around 
them. The dislocations gliding in the matrix leave 
loops around particles either by bowing between the 
particles or by cross-slipping around them; both these 
mechanisms are discussed in Chapter 8. The stresses in 
and around particles may also be relieved by activating 
secondary slip systems in the matrix. All these dislo- 
cations spread out from the particle as strain proceeds 
and, by intersecting the primary glide plane, hinder 
primary dislocation motion and lead to intense work- 
hardening. A dense tangle of dislocations is built up 
at the particle and a cell structure is formed with the 
particles predominantly in the cell walls. 

At small strains (<1%) work-hardening probably 
arises from the back-stress exerted by the few Orowan 
loops around the particles, as described by Fisher, Hart 
and Pry. The stress -strain curve is reasonably linear 
with strain s according to 

a = Oi + a/nf 3/2 £ 

with the work-hardening depending only on /, the vol- 
ume fraction of particles. At larger strains the ‘geomet- 
rically necessary’ dislocations stored to accommodate 


the strain gradient which arises because one component 
deforms plastically more than the other, determine the 
work-hardening. A determination of the average den- 
sity of dislocations around the particles with which the 
primary dislocations interact allows an estimate of the 
work-hardening rate, as initially considered by Ashby. 
Thus, for a given strain s and particle diameter d the 
number of loops per particle is 

n ~ sd/b 

and the number of particles per unit volume 

A/y = 3f /4nr 2 , or 6/ /nd 3 

The total number of loops per unit volume is nN w and 
hence the dislocation density p = nN Y nd = 6 fs/db. 
The stress- strain relationship from equation (7.27) 
is then 


o = Oi + a/n(fb/d) l/2 £ l/2 

(7.29) 

and the work-hardening rate 


do/d£ = a' fi(f/d) l/2 (b/£) l/2 

(7.30) 


Alternative models taking account of the detailed struc- 
ture of the dislocation arrays (e.g. Orowan, prismatic 
and secondary loops) have been produced to explain 
some of the finer details of dispersion-hardened mate- 
rials. However, this simple approach provides a useful 
working basis for real materials. Some additional fea- 
tures of dispersion- strengthened alloys are discussed 
in Chapter 8. 

7. 6. 2. 5 Work-hardening in ordered alloys 
A characteristic feature of alloys with long-range order 
is that they work-harden more rapidly than in the 
disordered state. 0 n for Fe-Al with a B2 ordered 
structure is ~/x/50 at room temperature, several times 
greater than a typical fee or bcc metal. However, 
the density of secondary dislocations in Stage II is 
relatively low and only about 1/100 of that of the 
primary dislocations. One mechanism for the increase 
in work-hardening rate is thought to arise from the 
generation of antiphase domain boundary (apb) tubes. 
A possible geometry is shown in Figure 7.43a; the 
superdislocation partials shown each contain a jog 
produced, for example, by intersection with a forest 
dislocation, which are nonaligned along the direction 
of the Burgers vector. When the dislocation glides 
and the jogs move nonconservatively a tube of apbs 
is generated. Direct evidence for the existence of 
tubes from weak-beam electron microscope studies 
was first reported for Fe-30 at.% Al. The micrographs 
show faint lines along (1 1 1), the Burgers vector 
direction, and are about 3 nm in width. The images 
are expected to be weak, since the contrast arises 
from two closely spaced overlapping faults, the second 
effectively cancelling the displacement caused by the 
first, and are visible only when superlattice reflections 
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Figure 7.43 Schematic diagram of superdislocation (a) with non-aligned jogs, which, after glide, produce an apb-tube and 
(b) cross-slipped onto the cube plane to form a Kear-Wilsdorf (K-W) lock. 


are excited. APB tubes have since been observed in 
other compounds. 

Theory suggests that jogs in superdislocations in 
screw orientations provide a potent hardening mecha- 
nism, estimated to be about eight times as strong as that 
resulting from pulling out of apb tubes on non-aligned 
jogs on edge dislocations. The major contributions to 
the stress to move a dislocation are (1) r s , the stress to 
generate point defects or tubes, and (2) the interaction 
stress ti with dislocations on neighbouring slip planes, 
and r s + ti = |o' s /x(pf/p p )£:. Thus, with a s = 1.3 and 
provided Pf/p p is constant and small, linear hardening 
with the observed rate is obtained. 

In crystals with A 3 B order only one rapid stage 
of hardening is observed compared with the normal 
three- stage hardening of fee metals. Moreover, the 
temperature-dependence of On/fi increases with tem- 
perature and peaks at ~0.4T m . It has been argued 
that the apb tube model is unable to explain why 
anomalously high work-hardening rates are observed 
for those single crystal orientations favourable for sin- 
gle slip on {1 1 1} planes alone. An alternative model 
to apb tubes has been proposed based on cross-slip of 
the leading unit dislocation of the superdislocation. If 
the second unit dislocation cannot follow exactly in 
the wake of the first, both will be pinned. 

For alloys with Ll 2 structure the cross-slip of a 
screw superpartial with b = ^ [10 1] from the primary 
(111) plane to the (0 1 0) plane was first proposed by 
Kear and Wilsdorf. The two ^ [10 1] superpartials, one 
on the (1 1 1) plane and the other on the (0 1 0) plane, 
are of course dissociated into (1 1 2) -type partials and 
the whole configuration is sessile. This dislocation 
arrangement is known as a Kear-Wilsdorf (K-W) 
lock and is shown in Figure 7.43b. Since cross-slip 
is thermally activated, the number of locks and there- 
fore the resistance to (111) glide will increase with 


increasing temperature. This could account for the 
increase in yield stress with temperature, while the 
onset of cube slip at elevated temperatures could 
account for the peak in the flow stress. 

Cube cross-slip and cube slip has now been observed 
in a number of Ll 2 compounds by TEM. There is some 
TEM evidence that the apb energy on the cube plane is 
lower than that on the (1 1 1) plane (see Chapter 9) to 
favour cross-slip which would be aided by the torque, 
arising from elastic anisotropy, exerted between the 
components of the screw dislocation pair. 

7.6.3 Development of preferred orientation 

7.6.3. 1 Crystallographic aspects 
When a poly crystalline metal is plastically deformed 
the individual grains tend to rotate into a common 
orientation. This preferred orientation is developed 
gradually with increasing deformation, and although 
it becomes extensive above about 90% reduction in 
area, it is still inferior to that of a good single crystal. 
The degree of texture produced by a given deformation 
is readily shown on a monochromatic X-ray transmis- 
sion photograph, since the grains no longer reflect uni- 
formly into the diffraction rings but only into certain 
segments of them. The results are usually described in 
terms of an ideal orientation, such as [ u , v, w] for the 
fibre texture developed by drawing or swaging, and 
{hkl}(uvw) for a rolling texture for which a plane of 
the form ( hkl ) lies parallel to the rolling plane and a 
direction of the type (uvw) is parallel to the rolling 
direction. However, the scatter about the ideal orienta- 
tion can only be represented by means of a pole-figure 
which describes the spread of orientation about the 
ideal orientation for a particular set of (hkl) poles (see 
Figure 7.44). 
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Rolling direction Rolling 



Figure 7.44 (111) pole figures from (a) copper, and (b) a-brass after 95% deformation ( intensities in arbitrary units). 


In tension, the grains rotate in such a way that the 
movement of the applied stress axis is towards the 
operative slip direction as discussed in Section 7.3.5 
and for compression the applied stress moves towards 
the slip plane normal. By considering the deformation 
process in terms of the particular stresses operating and 
applying the appropriate grain rotations it is possible to 
predict the stable end-grain orientation and hence the 
texture developed by extensive deformation. Table 7.3 
shows the predominant textures found in different 
metal structures for both wires and sheet. 

For fee metals a marked transition in deformation 
texture can be effected either by lowering the defor- 
mation temperature or by adding solid solution alloy- 
ing elements which lower the stacking fault energy. 
The transition relates to the effect on deformation 
modes of reducing stacking fault energy or thermal 
energy, deformation banding and twinning becoming 
more prevalent and cross-slip less important at lower 
temperatures and stacking fault energies. This texture 
transition can be achieved in most fee metals by alloy- 
ing additions and by altering the rolling temperature. 
Al, however, has a high fault energy and because of 
the limited solid solubility it is difficult to lower by 
alloying. The extreme types of rolling texture, shown 
by copper and 70/30 brass, are given in Figures 7.44a 
and 7.44b. 

In bcc metals there are no striking examples of 
solid solution alloying effects on deformation texture, 
the preferred orientation developed being remarkably 
insensitive to material variables. However, material 
variables can affect eph textures markedly. Variations 
in c/a ratio alone cause alterations in the orientation 


developed, as may be appreciated by consideration 
of the twinning modes, and it is also possible that 
solid solution elements alter the relative values of 
critical resolved shear stress for different deformation 
modes. Processing variables are also capable of giving 
a degree of control in hexagonal metals. No texture, 
stable to further deformation, is found in hexagonal 
metals and the angle of inclination of the basal planes 
to the sheet plane varies continuously with deforma- 
tion. In general, the basal plane lies at a small angle 
(<45° ) to the rolling plane, tilted either towards the 
rolling direction (Zn, Mg) or towards the transverse 
direction (Ti, Zr, Be, Hf). 

The deformation texture cannot, in general, be elim- 
inated by an annealing operation even when such a 
treatment causes recrystallization. Instead, the forma- 
tion of a new annealing texture usually results, which 
is related to the deformation texture by standard lattice 
rotations. 

1 .632 Texture-hardening 

The flow stress in single crystals varies with orienta- 
tion according to Schmid’s law and hence materials 
with a preferred orientation will also show similar 
plastic anisotropy, depending on the perfection of the 
texture. The significance of this relationship is well 
illustrated by a crystal of beryllium which is eph and 
capable of slip only on the basal plane, a compres- 
sive stress approaching ~2000 MN/m 2 applied normal 
to the basal plane produces negligible plastic defor- 
mation. Poly crystalline beryllium sheet, with a texture 
such that the basal planes lie in the plane of the sheet, 


Table 7.3 Deformation textures in metals with common crystal structures 


Structure 

Wire (fibre texture) 

Sheet (rolling texture) 

bcc 

[HO] 

{112} (llO) to {100} (Oil) 

fee 

[1 1 1], [100] double fibre 

{110} (112) to {351} (112) 

eph 

[210] 

{0001} (1000) 
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shows a correspondingly high strength in biaxial ten- 
sion. When stretched uniaxially the flow stress is also 
quite high, when additional (prismatic) slip planes are 
forced into action even though the shear stress for their 
operation is five times greater than for basal slip. Dur- 
ing deformation there is little thinning of the sheet, 
because the (1 1 20) directions are aligned in the plane 
of the sheet. Other hexagonal metals, such as tita- 
nium and zirconium, show less marked strengthening 
in uniaxial tension because prismatic slip occurs more 
readily, but resistance to biaxial tension can still be 
achieved. Applications of texture-hardening lie in the 
use of suitably textured sheet for high biaxial strength, 
e.g. pressure vessels, dent resistance, etc. Because of 
the multiplicity of slip systems, cubic metals offer 
much less scope for texture-hardening. Again, a con- 
sideration of single crystal deformation gives the clue; 
for whereas in a hexagonal crystal m can vary from 2 
(basal planes at 45° to the stress axis) to infinity (when 
the basal planes are normal), in an fee crystal m can 
vary only by a factor of 2 with orientation, and in bcc 
crystals the variation is rather less. In extending this 
approach to poly crystalline material certain assump- 
tions have to be made about the mutual constraints 
between grains. One approach gives m = 3.1 for a 
random aggregate of fee crystals and the calculated 
orientation dependence of <r/r for fibre texture shows 
that a rod with (1 1 1) or (1 1 0) texture (p/x = 3.664) 
is 20% stronger than a random structure; the cube tex- 
ture (p/x = 2.449) is 20% weaker. 

If conventional mechanical properties were the sole 
criterion for texture-hardened materials, then it seems 
unlikely that they would challenge strong precipitation- 
hardened alloys. However, texture-hardening has more 
subtle benefits in sheet metal forming in optimizing 
fabrication performance. The variation of strength in 
the plane of the sheet is readily assessed by tensile 
tests carried out in various directions relative to the 
rolling direction. In many sheet applications, however, 
the requirement is for through-thickness strength (e.g. 
to resist thinning during pressing operations). This is 
more difficult to measure and is often assessed from 
uniaxial tensile tests by measuring the ratio of the 
strain in the width direction to that in the thickness 
direction of a test piece. The strain ratio R is given by 

R = £ w /£ t = ln(wo/w)/ln(£ 0 /0 
= ln(w 0 /w)/ln(wL/w 0 L) (7.31) 

where w 0 , L 0 , t 0 are the original dimensions of width, 
length and thickness and w, L and t are the corre- 
sponding dimensions after straining, which is derived 
assuming no change in volume occurs. The average 
strain ratio R , for tests at various angles in the plane of 
the sheet, is a measure of the normal anisotropy, i.e. the 
difference between the average properties in the plane 
of the sheet and that property in the direction normal to 
the sheet surface. A large value of R means that there 
is a lack of deformation modes oriented to provide 



Figure 7.45 Schematic diagram of the deep-drawing 
operations indicating the stress systems operating in the 
flange and the cup wall. Limiting drawing ratio is defined as 
the ratio of the diameter of the largest blank which can 
satisfactorily complete the draw (D max ) to the punch 
diameter (d) (after Dillamore, Smallman and Wilson, 1969; 
courtesy of the Canadian Institute of Mining and 
Metallurgy). 

strain in the through-thickness direction, indicating a 
high through-thickness strength. 

In deep-drawing, schematically illustrated in 
Figure 7.45, the dominant stress system is radial 
tension combined with circumferential compression in 
the drawing zone, while that in the base and lower 
cup wall (i.e. central stretch-forming zone) is biaxial 
tension. The latter stress is equivalent to a through- 
thickness compression, plus a hydrostatic tension 
which does not affect the state of yielding. Drawing 
failure occurs when the central stretch-forming zone 
is insufficiently strong to support the load needed to 
draw the outer part of the blank through the die. 
Clearly differential strength levels in these two regions, 
leading to greater ease of deformation in the drawing 
zone compared with the stretching zone, would enable 
deeper draws to be made: this is the effect of increasing 
the R value, i.e. high through-thickness strength 
relative to strength in the plane of the sheet will favour 
drawability. This is confirmed in Figure 7.46, where 
deep drawability as determined by limiting drawing 
ratio (i.e. ratio of maximum drawable blank diameter 
to final cup diameter) is remarkably insensitive to 
ductility and, by inference from the wide range of 
materials represented in the figure, to absolute strength 
level. Here it is noted that for hexagonal metals slip 
occurs readily along (1 1 20) thus contributing no strain 
in the c-direction, and twinning only occurs on the 
{1012} when the applied stress nearly parallel to 
the c-axis is compressive for c/ a > \/3 and tensile 
for c/a < a/3. Thus titanium, c/a < \/3, has a high 
strength in through-thickness compression, whereas 
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Figure 7.46 Limiting draw ratios (LDR) as a function of 
average values ofR and of elongation to fracture measured 
in tensile tests at 0°, 45° and 90° to the rolling direction 
( after Wilson, 1966; courtesy of the Institute of Metals). 

Zn with c/ a < V3 has low through-thickness strength 
when the basal plane is oriented parallel to the plane of 
the sheet. In contrast, hexagonal metals with c/ a > \/3 
would have a high R for {1 0 10} parallel to the plane 
of the sheet. 

Texture-hardening is much less in the cubic met- 
als, but fee materials with {1 1 1} (110) slip system 
and bcc with {1 1 0} (1 1 1) are expected to increase 
R when the texture has component with {111} and 
{110} parallel to the plane of the sheet. The range of 
values of R encountered in cubic metals is much less. 
Face-centred cubic metals have R ranging from about 
0.3 for cube-texture, {100} (001), to a maximum, 
in textures so far attained, of just over 1.0. Higher 
values are sometimes obtained in body-centred cubic 
metals. Values of R in the range 1.4 ~ 1.8 obtained 
in aluminium-killed low-carbon steel are associated 
with significant improvements in deep-drawing per- 
formance compared with rimming steel, which has 
^-values between 1.0 and 1.4. The highest values of 
R in steels are associated with texture components 
with {111} parallel to the surface, while crystals with 
{10 0} parallel to the surface have a strongly depress- 
ing effect on R. 

In most cases it is found that the R values vary 
with testing direction and this has relevance in rela- 
tion to the strain distribution in sheet metal forming. In 
particular, ear formation on pressings generally devel- 
ops under a predominant uniaxial compressive stress 
at the edge of the pressing. The ear is a direct con- 
sequence of the variation in strain ratio for different 
directions of uniaxial stressing, and a large variation 
in R value, where A R = (R max — R m in) generally cor- 
relates with a tendency to form pronounced ears. On 
this basis we could write a simple recipe for good 


deep-drawing properties in terms of strain ratio mea- 
surements made in a uniaxial tensile test as high R and 
low A R. Much research is aimed at improving forming 
properties through texture control. 

7.7 Macroscopic plasticity 

7.7.1 Tresca and von Mises criteria 

In dislocation theory it is usual to consider the flow 
stress or yield stress of ductile metals under simple 
conditions of stressing. In practice, the engineer deals 
with metals under more complex conditions of stress- 
ing (e.g. during forming operations) and hence needs to 
correlate yielding under combined stresses with that in 
uniaxial testing. To achieve such a yield stress criterion 
it is usually assumed that the metal is mechanically 
isotropic and deforms plastically at constant volume, 
i.e. a hydrostatic state of stress does not affect yield- 
ing. In assuming plastic isotropy, macroscopic shear is 
allowed to take place along lines of maximum shear 
stress and crystallographic slip is ignored, and the yield 
stress in tension is equal to that in compression, i.e. 
there is no Bauschinger effect. 

A given applied stress state in terms of the princi- 
pal stresses <7 1} <r 2 , <r 3 which act along three principal 
axes, Xi,X 2 and X 3 , may be separated into the hydro- 
static part (which produces changes in volume) and 
the deviatoric components (which produce changes in 
shape). It is assumed that the hydrostatic component 
has no effect on yielding and hence the more the stress 
state deviates from pure hydrostatic, the greater the 
tendency to produce yield. The stresses may be rep- 
resented on a stress-space plot (see Figure 7.47a), in 
which a line equidistant from the three stress axes rep- 
resents a pure hydrostatic stress state. Deviation from 
this line will cause yielding if the deviation is suf- 
ficiently large, and define a yield surface which has 
sixfold symmetry about the hydrostatic line. This arises 
because the conditions of isotropy imply equal yield 
stresses along all three axes, and the absence of the 
Bauschinger effect implies equal yield stresses along 
ct i and — cq. Taking a section through stress space, 
perpendicular to the hydrostatic line gives the two 
simplest yield criteria satisfying the symmetry require- 
ments corresponding to a regular hexagon and a circle. 

The hexagonal form represents the Tresca criterion 
(see Figure 7.47c) which assumes that plastic shear 
takes place when the maximum shear stress attains a 
critical value k equal to shear yield stress in uniaxial 
tension. This is expressed by 

Fnax — ^ ^ ( ' *32) 

where the principal stresses o\ > <j 2 > cr 3 . This crite- 
rion is the isotropic equivalent of the law of resolved 
shear stress in single crystals. The tensile yield stress 
Y = 2k is obtained by putting <ti = Y, <j 2 = <j 3 = 0. 
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Figure 7.47 Schematic representation of the yield surface with (a) principal stresses 01 , 02 and o 3 , (b) von Mises yield 
criterion and (c) Tresca yield criterion. 


The circular cylinder is described by the equation 

(<Ti — a 2 ) 2 + (<r 2 — <r 3 ) 2 + (<r 3 — <Ti ) 2 = constant 

(7.33) 

and is the basis of the von Mises yield criterion (see 
Figure 7.47b). This criterion implies that yielding will 
occur when the shear energy per unit volume reaches 
a critical value given by the constant. This constant 
is equal to 6 k 2 or 2T 2 where k is the yield stress in 
simple shear, as shown by putting <r 2 = 0, 0 \ = cr 3 , 
and Y is the yield stress in uniaxial tension when 
a 2 = <73 = 0. Clearly Y = 3k compared to Y = 2k for 
the Tresca criterion and, in general, this is found to 
agree somewhat closer with experiment. 

In many practical working processes (e.g. rolling), 
the deformation occurs under approximately plane 
strain conditions with displacements confined to the 
XiX 2 plane. It does not follow that the stress in this 
direction is zero, and, in fact, the deformation condi- 
tions are satisfied if <j 3 = l,{o\ + <r 2 ) so that the ten- 
dency for one pair of principal stresses to extend the 
metal along the X 3 axis is balanced by that of the other 
pair to contract it along this axis. Eliminating <r 3 from 
the von Mises criterion, the yield criterion becomes 



Figure 7.48 The von Mises yield ellipse and Tresca yield 
hexagon. 

inscribed in the ellipse as shown in Figure 7.48. Thus, 
when <7i and o 2 have opposite signs, the Tresca crite- 
rion becomes o\ — o 2 = 2k — Y and is represented by 
the edges of the hexagon CD and FA. When they have 
the same sign then o\ = 2k = Y or o 2 = 2k = Y and 
defines the hexagon edges AB, BC, DE and EF. 

7.7.2 Effective stress and strain 


(at - o 2 ) = 2k 

and the plane strain yield stress, i.e. when <r 2 = 0, 
given when 

o'! =2k = 2Y/V3= 1.157 

For plane strain conditions, the Tresca and von 
Mises criteria are equivalent and two-dimensional flow 
occurs when the shear stress reaches a critical value. 
The above condition is thus equally valid when written 
in terms of the deviatoric stresses o [ , o' 2 , a' 3 defined 
by equations of the type cr[ = o\ — |(<Ti + o 2 + <x 3 ). 

Under plane stress conditions, <r 3 = 0 and the yield 
surface becomes two-dimensional and the von Mises 
criterion becomes 

o\ + < 7 ! <72 + < 7 2 2 = 3 k 2 = Y 2 ( 7 . 34 ) 

which describes an ellipse in the stress plane. For the 
Tresca criterion the yield surface reduces to a hexagon 


For an isotropic material, a knowledge of the uniaxial 
tensile test behaviour together with the yield func- 
tion should enable the stress -strain behaviour to be 
predicted for any stress system. This is achieved by 
defining an effective stress -effective strain relation- 
ship such that if <r = Ks n is the uniaxial stress -strain 
relationship then we may write. 

a = Ks n (7.35) 

for any state of stress. The stress -strain behaviour of 
a thin-walled tube with internal pressure is a typi- 
cal example, and it is observed that the flow curves 
obtained in uniaxial tension and in biaxial torsion 
coincide when the curves are plotted in terms of effec- 
tive stress and effective strain. These quantities are 
defined by: 

a/2 

a = — [(<7i - <r 2 ) 2 + {a 2 - a 3 ) 2 + (<r 3 - <7i) 2 ] 1/2 

(7.36) 
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and 

s = — [(£1 - s 2 f + (e 2 - £ 3) 2 + (£3 - £i ) 2 ] 1/2 

(7.37) 

where £1, £ 2 and £3 are the principal strains, both 
of which reduce to the axial normal components of 
stress and strain for a tensile test. It should be empha- 
sized, however, that this generalization holds only for 
isotropic media and for constant loading paths, i.e. 
<7 1 = a<j 2 = P&3 where a and /3 are constants inde- 
pendent of the value of o\. 

7.8 Annealing 

7.8.1 General effects of annealing 

When a metal is cold-worked, by any of the many 
industrial shaping operations, changes occur in both 
its physical and mechanical properties. While the 
increased hardness and strength which result from the 
working treatment may be of importance in certain 
applications, it is frequently necessary to return the 
metal to its original condition to allow further forming 
operations (e.g. deep drawing) to be carried out of for 
applications where optimum physical properties, such 
as electrical conductivity, are essential. The treatment 
given to the metal to bring about a decrease of the 
hardness and an increase in the ductility is known as 
annealing. This usually means keeping the deformed 
metal for a certain time at a temperature higher than 
about one-third the absolute melting point. 

Cold working produces an increase in dislocation 
density; for most metals p increases from the value 
of 10 10 -10 12 lines m~ 2 typical of the annealed state, 
to 10 12 -10 13 after a few per cent deformation, and 
up to 10 15 — 10 16 lines m~ 2 in the heavily deformed 
state. Such an array of dislocations gives rise to a 
substantial strain energy stored in the lattice, so that the 
cold-worked condition is thermodynamically unstable 
relative to the undeformed one. Consequently, the 
deformed metal will try to return to a state of lower 
free energy, i.e. a more perfect state. In general, this 
return to a more equilibrium structure cannot occur 
spontaneously but only at elevated temperatures where 
thermally activated processes such as diffusion, cross- 
slip and climb take place. Like all non-equilibrium 
processes the rate of approach to equilibrium will be 
governed by an Arrhenius equation of the form 

Rate = A exp [—Q/k T] 

where the activation energy Q depends on impurity 
content, strain, etc. 

The formation of atmospheres by strain-ageing is 
one method whereby the metal reduces its excess 
lattice energy but this process is unique in that it 
usually leads to a further increase in the structure- 
sensitive properties rather than a reduction to the value 
characteristic of the annealed condition. It is necessary, 


therefore, to increase the temperature of the deformed 
metal above the strain-ageing temperature before it 
recovers its original softness and other properties. 

The removal of the cold-worked condition occurs 
by a combination of three processes, namely: 
( 1 ) recovery, ( 2 ) recrystallization and ( 3 ) grain growth. 
These stages have been successfully studied using 
light microscopy, transmission electron microscopy, or 
X-ray diffraction; mechanical property measurements 
(e.g. hardness); and physical property measurements 
(e.g. density, electrical resistivity and stored energy). 
Figure 7.49 shows the change in some of these prop- 
erties on annealing. During the recovery stage the 
decrease in stored energy and electrical resistivity 
is accompanied by only a slight lowering of hard- 
ness, and the greatest simultaneous change in proper- 
ties occurs during the primary recrystallization stage. 
However, while these measurements are no doubt 
striking and extremely useful, it is necessary to under- 
stand them to correlate such studies with the structural 
changes by which they are accompanied. 

7.8.2 Recovery 

This process describes the changes in the distribution 
and density of defects with associated changes in phys- 
ical and mechanical properties which take place in 
worked crystals before recrystallization or alteration 
of orientation occurs. It will be remembered that the 
structure of a cold- worked metal consists of dense dis- 
location networks, formed by the glide and interaction 
of dislocations, and, consequently, the recovery stage 
of annealing is chiefly concerned with the rearrange- 
ment of these dislocations to reduce the lattice energy 
and does not involve the migration of large-angle 
boundaries. This rearrangement of the dislocations is 



Figure 7.49 (a) Rate of release of stored energy (A P), 
increment in electrical resistivity (A p) and hardness (VPN) 
for specimens of nickel deformed in torsion and heated at 6 
k/min (Clareborough, Hargreaves and West, 1955). 
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assisted by thermal activation. Mutual annihilation of 
dislocations is one process. 

When the two dislocations are on the same slip 
plane, it is possible that as they run together and 
annihilate they will have to cut through intersecting 
dislocations on other planes, i.e. ‘forest’ dislocations. 
This recovery process will, therefore, be aided by ther- 
mal fluctuations since the activation energy for such a 
cutting process is small. When the two dislocations of 
opposite sign are not on the same slip plane, climb or 
cross-slip must first occur, and both processes require 
thermal activation. 

One of the most important recovery processes which 
leads to a resultant lowering of the lattice strain 
energy is rearrangement of the dislocations into cell 
walls. This process in its simplest form was originally 
termed polygonization and is illustrated schematically 
in Figure 7.50, whereby dislocations all of one sign 
align themselves into walls to form small-angle or sub- 
grain boundaries. During deformation a region of the 
lattice is curved, as shown in Figure 7.50a, and the 
observed curvature can be attributed to the formation 
of excess edge dislocations parallel to the axis of bend- 
ing. On heating, the dislocations form a sub-boundary 
by a process of annihilation and rearrangement. This is 
shown in Figure 7.50b, from which it can be seen that 
it is the excess dislocations of one sign which remain 
after the annihilation process that align themselves into 
walls. 

Polygonization is a simple form of sub-boundary 
formation and the basic movement is climb whereby 
the edge dislocations change their arrangement from a 
horizontal to a vertical grouping. This process involves 
the migration of vacancies to or from the edge of the 
half-planes of the dislocations (see Section 4.3.4). The 


removal of vacancies from the lattice, together with the 
reduced strain energy of dislocations which results, can 
account for the large change in both electrical resis- 
tivity and stored energy observed during this stage, 
while the change in hardness can be attributed to the 
rearrangement of dislocations and to the reduction in 
the density of dislocations. 

The process of polygonization can be demonstrated 
using the Laue method of X-ray diffraction. Diffrac- 
tion from a bent single crystal of zinc takes the form 
of continuous radial streaks. On annealing, these aster- 
isms (see Figure 5.10) break up into spots as shown 
in Figure 7.50c, where each diffraction spot originates 
from a perfect polygonized sub-grain, and the distance 
between the spots represents the angular misorienta- 
tion across the sub-grain boundary. Direct evidence 
for this process is observed in the electron microscope, 
where, in heavily deformed polycrystalline aggregates 
at least, recovery is associated with the formation of 
sub-grains out of complex dislocation networks by 
a process of dislocation annihilation and rearrange- 
ment. In some deformed metals and alloys the disloca- 
tions are already partially arranged in sub-boundaries 
forming diffuse cell structures by dynamical recovery 
(see Figure 7.41). The conventional recovery process 
is then one in which these cells sharpen and grow. 
In other metals, dislocations are more uniformly dis- 
tributed after deformation, with hardly any cell struc- 
ture discernible, and the recovery process then involves 
formation, sharpening and growth of sub-boundaries. 
The sharpness of the cell structure formed by defor- 
mation depends on the stacking fault energy of the 
metal, the deformation temperature and the extent of 
deformation (see Figure 7.42). 



Figure 7.50 (a) Random arrangement of excess parallel edge dislocations and (b) alignment into dislocation walls; (c) Laue 
photograph of polygonized zinc ( after Cahn, 1949). 
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7.8.3 Recrystallization 

The most significant changes in the structure- sensitive 
properties occur during the primary recrystallization 
stage. In this stage the deformed lattice is completely 
replaced by a new unstrained one by means of a nucle- 
ation and growth process, in which practically stress- 
free grains grow from nuclei formed in the deformed 
matrix. The orientation of the new grains differs con- 
siderably from that of the crystals they consume, so 
that the growth process must be regarded as incoherent, 
i.e. it takes place by the advance of large-angle bound- 
aries separating the new crystals from the strained 
matrix. 

During the growth of grains, atoms get transferred 
from one grain to another across the boundary. Such a 
process is thermally activated as shown in Figure 7.51, 
and by the usual reaction-rate theory the frequency of 
atomic transfer one way is 

vexp 5-1 (738) 


and in the reverse direction 


vexp 


/ AF* + AF\ _ 1 
{ kT ) s 


(7.39) 


where AF is the difference in free energy per atom 
between the two grains, i.e. supplying the driving 
force for migration, and AF* is an activation energy. 
For each net transfer the boundary moves forward a 
distance b and the velocity v is given by 

v = MAF (7.40) 


where M is the mobility of the boundary, i.e. the 
velocity for unit driving force, and is thus 


bv 

M = — exp 
kT F 




(7.41) 


Generally, the open structure of high-angle bound- 
aries should lead to a high mobility. However they are 



susceptible to the segregation of impurities, low con- 
centrations of which can reduce the boundary mobility 
by orders of magnitude. In contrast, special bound- 
aries which are close to a CSL are much less affected 
by impurity segregation and hence can lead to higher 
relative mobility. 

It is well known that the rate of recrystallization 
depends on several important factors, namely: (1) the 
amount of prior deformation (the greater the degree 
of cold work, the lower the recrystallization tempera- 
ture and the smaller the grain size), (2) the tempera- 
ture of the anneal (as the temperature is lowered the 
time to attain a constant grain size increases exponen- 
tially 1 ) and (3) the purity of the sample (e.g. zone- 
refined aluminium recrystallizes below room tempera- 
ture, whereas aluminium of commercial purity must be 
heated several hundred degrees). The role these vari- 
ables play in recrystallization will be evident once the 
mechanism of recrystallization is known. This mecha- 
nism will now be outlined. 

Measurements, using the light microscope, of the 
increase in diameter of a new grain as a function of 
time at any given temperature can be expressed as 
shown in Figure 7.52. The diameter increases linearly 
with time until the growing grains begin to impinge on 
one another, after which the rate necessarily decreases. 
The classical interpretation of these observations is 
that nuclei form spontaneously in the matrix after a 
so-called nucleation time, t 0 , and these nuclei then 
proceed to grow steadily as shown by the linear rela- 
tionship. The driving force for the process is provided 
by the stored energy of cold work contained in the 
strained grain on one side of the boundary relative to 
that on the other side. Such an interpretation would 
suggest that the recrystallization process occurs in two 
distinct stages, i.e. first nucleation and then growth. 

During the linear growth period the radius of a 
nucleus is R = G(t — to), where G, the growth rate, is 



Figure 7.52 Variation of grain diameter with time at a 
constant temperature. 

'The velocity of linear growth of new crystals usually 
obeys an exponential relationship of the form 
v = v 0 exp [-Q/RT]. 
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dR/dt and, assuming the nucleus is spherical, the vol- 
ume of the recrystallized nucleus is An/3G 3 {t — to) 3 . 
If the number of nuclei that form in a time increment 
d/' is df per unit volume of unrecry stallized matrix, 
and if the nuclei do not impinge on one another, then 
for unit total volume 

/ = \nNG 3 [ (t- t 0 fdt 
d Jo 

or 

/ = |gV (7.42) 

This equation is valid in the initial stages when 
/ 1. When the nuclei impinge on one another the 

rate of recrystallization decreases and is related to the 
amount untransformed (1 — /) by 

/ = l-exp (-|tVGY) (7.43) 

where, for short times, equation (7.43) reduces to 

equation (7.42). This Johnson -Mehl equation is 
expected to apply to any phase transformation where 
there is random nucleation, constant N and G and small 
to. In practice, nucleation is not random and the rate 
not constant so that equation (7.43) will not strictly 
apply. For the case where the nucleation rate decreases 
exponentially, Avrami developed the equation 

/ = 1 - exp (-fa") (7.44) 

where k and n are constants, with n ~ 3 for a fast and 
n ~ 4 for a slow, decrease of nucleation rate. Provided 
there is no change in the nucleation mechanism, n 
is independent of temperature but k is very sensitive 
to temperature T ; clearly from equation (7.43), k = 
ttNG 3 /3 and both N and G depend on T. 

An alternative interpretation is that the so-called 
incubation time to represents a period during which 
small nuclei, of a size too small to be observed in the 
light microscope, are growing very slowly. This lat- 
ter interpretation follows from the recovery stage of 
annealing. Thus, the structure of a recovered metal 
consists of sub-grain regions of practically perfect 
crystal and, thus, one might expect the ‘active’ recrys- 
tallization nuclei to be formed by the growth of certain 
sub-grains at the expense of others. 

The process of recrystallization may be pictured as 
follows. After deformation, polygonization of the bent 
lattice regions on a fine scale occurs and this results in 
the formation of several regions in the lattice where the 
strain energy is lower than in the surrounding matrix; 
this is a necessary primary condition for nucleation. 
During this initial period when the angles between 
the sub-grains are small and less than one degree, 
the sub-grains form and grow quite rapidly. However, 
as the sub-grains grow to such a size that the angles 
between them become of the order of a few degrees, 
the growth of any given sub-grain at the expense of the 
others is very slow. Eventually one of the sub-grains 


will grow to such a size that the boundary mobility 
begins to increase with increasing angle. A large angle 
boundary, 0 ~ 30-40°, has a high mobility because 
of the large lattice irregularities or ‘gaps’ which exist 
in the boundary transition layer. The atoms on such 
a boundary can easily transfer their allegiance from 
one crystal to the other. This sub-grain is then able 
to grow at a much faster rate than the other sub- 
grains which surround it and so acts as the nucleus 
of a recrystallized grain. The further it grows, the 
greater will be the difference in orientation between 
the nucleus and the matrix it meets and consumes, until 
it finally becomes recognizable as a new strain-free 
crystal separated from its surroundings by a large-angle 
boundary. 

The recrystallization nucleus therefore has its origin 
as a sub-grain in the deformed micro structure. Whether 
it grows to become a strain-free grain depends on three 
factors: (1) the stored energy of cold work must be 
sufficiently high to provide the required driving force, 
(2) the potential nucleus should have a size advantage 
over its neighbours, and (3) it must be capable of con- 
tinued growth by existing in a region of high lattice 
curvature (e.g. transition band) so that the growing 
nucleus can quickly achieve a high-angle boundary. In 
situ experiments in the HVEM have confirmed these 
factors. Figure 7.53a shows the as-deformed substruc- 
ture in the transverse section of rolled copper, together 
with the orientations of some selected areas. The sub- 
grains are observed to vary in width from 50 to 500 
nm, and exist between regions 1 and 8 as a transition 
band across which the orientation changes sharply. On 
heating to 200° C, the sub-grain region 2 grows into the 
transition region (Figure 7.53b) and the orientation of 
the new grain well developed at 300° C is identical to 
the original sub-grain (Figure 7.53c). 

With this knowledge of recrystallization the influ- 
ence of several variables known to affect the recrys- 
tallization behaviour of a metal can now be under- 
stood. Prior deformation, for example, will control the 
extent to which a region of the lattice is curved. The 
larger the deformation, the more severely will the lat- 
tice be curved and, consequently, the smaller will be 
the size of a growing sub-grain when it acquires a 
large-angle boundary. This must mean that a shorter 
time is necessary at any given temperature for the sub- 
grain to become an ‘active’ nucleus, or conversely, 
that the higher the annealing temperature, the quicker 
will this stage be reached. In some instances, heavily 
cold-worked metals recrystallize without any signif- 
icant recovery owing to the formation of strain-free 
cells during deformation. The importance of impurity 
content on recrystallization temperature is also evident 
from the effect impurities have on obstructing disloca- 
tion sub-boundary and grain boundary mobility. 

The intragranular nucleation of strain-free grains, 
as discussed above, is considered as abnormal sub- 
grain growth, in which it is necessary to specify 
that some sub-grains acquire a size advantage and 
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1 . (1J1)[1 10] 6. (1 12)[1 10] 

2. (1J0)[1 12] 7. (1 11)[1 10] 

3. (1 1_0)[1 12] 8. (1 11)[1 10] 

4. (112)[111] 9. (4J35)[1 1 1] 

5. (1 51 )[1 23] 10. (1 10)[1 12] 



Figure 7.53 Electron micrographs of copper, (a) Cold-rolled 95% at room temperature, transverse section, (b) heated to 
300 0 C in the HVEM. 


are able to grow at the expense of the normal sub- 
grains. It has been suggested that nuclei may also 
be formed by a process involving the rotation of 
individual cells so that they coalesce with neighbouring 
cells to produce larger cells by volume diffusion and 
dislocation rearrangement. 

In some circumstances, intergranular nucleation is 
observed in which an existing grain boundary bows 
out under an initial driving force equal to the difference 
in free energy across the grain boundary. This strain- 
induced boundary migration is irregular and is from 
a grain with low strain (i.e. large cell size) to one 
of larger strain and smaller cell size. For a boundary 
to grow in this way the strain energy difference per 
unit volume across the boundary must be sufficient 
to supply the energy increase to bow out a length of 
boundary ~1 pm. 


Segregation of solute atoms to, and precipitation 
on, the grain boundary tends to inhibit intergranu- 
lar nucleation and gives an advantage to intragran- 
ular nucleation, provided the dispersion is not too 
fine. In general, the recrystallization behaviour of two- 
phase alloys is extremely sensitive to the dispersion 
of the second phase. Small, finely dispersed particles 
retard recrystallization by reducing both the nucleation 
rate and the grain boundary mobility, whereas large 
coarsely dispersed particles enhance recrystallization 
by increasing the nucleation rate. During deformation, 
zones of high dislocation density and large misorien- 
tations are formed around non-deformable particles, 
and on annealing, recrystallization nuclei are created 
within these zones by a process of polygonization by 
sub-boundary migration. Particle- stimulated nucleation 
occurs above a critical particle size which decreases 
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with increasing deformation. The finer dispersions tend 
to homogenize the microstructure (i.e. dislocation dis- 
tribution) thereby minimizing local lattice curvature 
and reducing nucleation. 

The formation of nuclei becomes very difficult when 
the spacing of second-phase particles is so small that 
each developing sub-grain interacts with a particle 
before it becomes a viable nucleus. The extreme case 
of this is SAP (sintered aluminium powder) which con- 
tains very stable, close-spaced oxide particles. These 
particles prevent the rearrangement of dislocations into 
cell walls and their movement to form high-angle 
boundaries, and hence SAP must be heated to a tem- 
perature very close to the melting point before it 
recrystallizes. 

7.8.4 Grain growth 

When primary recrystallization is complete (i.e. when 
the growing crystals have consumed all the strained 
material) the material can lower its energy further by 
reducing its total area of grain surface. With exten- 
sive annealing it is often found that grain boundaries 
straighten, small grains shrink and larger ones grow. 
The general phenomenon is known as grain growth, 
and the most important factor governing the process is 
the surface tension of the grain boundaries. A grain 
boundary has a surface tension, T (= surface-free 
energy per unit area) because its atoms have a higher 
free energy than those within the grains. Consequently, 
to reduce this energy a poly crystal will tend to min- 
imize the area of its grain boundaries and when this 
occurs the configuration taken up by any set of grain 
boundaries (see Figure 7.54) will be governed by the 
condition that 

T a / sin A = 7V sin 5 = T c / sin C (7.45) 

Most grain boundaries are of the large-angle type 
with their energies approximately independent of ori- 
entation, so that for a random aggregate of grains 
T a = T b = T c and the equilibrium grain boundary 
angles are each equal to 120°. Figure 7.54b shows an 
idealized grain in two dimensions surrounded by others 




Figure 7.54 (a) Relation between angles and surface 
tensions at a grain boundary triple point; (b) idealized 
polygonal grain structure. 


of uniform size, and it can be seen that the equilibrium 
grain shape takes the form of a polygon of six sides 
with 120° inclusive angles. All polygons with either 
more or less than this number of sides cannot be in 
equilibrium. At high temperatures where the atoms are 
mobile, a grain with fewer sides will tend to become 
smaller, under the action of the grain boundary surface 
tension forces, while one with more sides will tend to 
grow. 

Second-phase particles have a major inhibiting 
effect on boundary migration and are particularly 
effective in the control of grain size. The pinning 
process arises from surface tension forces exerted by 
the particle -matrix interface on the grain boundary as 
it migrates past the particle. Figure 7.55 shows that the 
drag exerted by the particle on the boundary, resolved 
in the forward direction, is 

F = nry sin 20 

where y is the specific interfacial energy of the 
boundary; F = F max = nry when 6 = 45°. Now if 
there are N particles per unit volume, the volume 
fraction is 4nr 3 N /3 and the number n intersecting unit 
area of boundary is given by 

n = 3f/2jrr 2 (7.46) 

For a grain boundary migrating under the influence 
of its own surface tension the driving force is 2 y/R, 
where R is the minimum radius of curvature and as 
the grains grow, R increases and the driving force 
decreases until it is balanced by the particle-drag, when 
growth stops. If R ~ d the mean grain diameter, then 
the critical grain diameter is given by the condition 

nF « 2y/d cAt 
or 

d a \x « 2y(2nr 2 /3f7iry) = 4r/3f (7.47) 

This Zener drag equation overestimates the driving 
force for grain growth by considering an isolated 



Figure 7.55 Diagram showing the drag exerted on a 
boundary by a particle. 
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spherical grain. A heterogeneity in grain size is 
necessary for grain growth and taking this into account 
gives a revised equation: 


nr " 3 2 " 

3 J[2~Z_ 


(7.48) 


where Z is the ratio of the diameters of growing grains 
to the surrounding grains. This treatment explains the 
successful use of small particles in refining the grain 
size of commercial alloys. 

During the above process growth is continuous and 
a uniform coarsening of the poly crystalline aggregate 
usually occurs. Nevertheless, even after growth has 
finished the grain size in a specimen which was 
previously severely cold-worked remains relatively 
small, because of the large number of nuclei produced 
by the working treatment. Exaggerated grain growth 
can often be induced, however, in one of two 
ways, namely: (1) by subjecting the specimen to a 
critical strain-anneal treatment or (2) by a process 
of secondary recrystallization. By applying a critical 
deformation (usually a few per cent strain) to the 
specimen the number of nuclei will be kept to 
a minimum, and if this strain is followed by a 
high-temperature anneal in a thermal gradient some 
of these nuclei will be made more favourable for 
rapid growth than others. With this technique, if the 
conditions are carefully controlled, the whole of the 
specimen may be turned into one crystal, i.e. a single 
crystal. The term secondary recrystallization describes 
the process whereby a specimen which has been 
given a primary recrystallization treatment at a low 
temperature is taken to a higher temperature to enable 
the abnormally rapid growth of a few grains to occur. 
The only driving force for secondary recrystallization 
is the reduction of grain boundary-free energy, as 
in normal grain growth, and consequently, certain 
special conditions are necessary for its occurrence. 
One condition for this ‘abnormal’ growth is that 
normal continuous growth is impeded by the presence 
of inclusions, as is indicated by the exaggerated 
grain growth of tungsten wire containing thoria, or 
the sudden coarsening of deoxidized steel at about 
1000°C. A possible explanation for the phenomenon 
is that in some regions the grain boundaries become 
free (e.g. if the inclusions slowly dissolve or the 
boundary tears away) and as a result the grain size 
in such regions becomes appreciably larger than the 
average (Figure 7.56a). It then follows that the grain 




Figure 7.56 Grain growth during secondary 
recrystallization. 


boundary junction angles between the large grain 
and the small ones that surround it will not satisfy 
the condition of equilibrium discussed above. As a 
consequence, further grain boundary movement to 
achieve 120° angles will occur, and the accompanying 
movement of a triple junction point will be as shown 
in Figure 7.56b. However, when the dihedral angles 
at each junction are approximately 120° a severe 
curvature in the grain boundary segments between 
the junctions will arise, and this leads to an increase 
in grain boundary area. Movement of these curved 
boundary segments towards their centres of curvature 
must then take place and this will give rise to the 
configuration shown in Figure 7.56c. Clearly, this 
sequence of events can be repeated and continued 
growth of the large grains will result. 

The behaviour of the dispersed phase is extremely 
important in secondary recrystallization and there are 
many examples in metallurgical practice where the 
control of secondary recrystallization with dispersed 
particles has been used to advantage. One example 
is in the use of Fe-3% Si in the production of strip 
for transformer laminations. This material is required 
with (1 1 0) [00 1] ‘Goss’ texture because of the [00 1] 
easy direction of magnetization, and it is found that 
the presence of MnS particles favours the growth of 
secondary grains with the appropriate Goss texture. 
Another example is in the removal of the pores during 
the sintering of metal and ceramic powders, such as 
alumina and metallic carbides. The sintering process 
is essentially one of vacancy creep involving the 
diffusion of vacancies from the pore of radius r to 
a neighbouring grain boundary, under a driving force 
2 y s /r where y s is the surface energy. In practice, 
sintering occurs fairly rapidly up to about 95% full 
density because there is a plentiful association of 
boundaries and pores. When the pores become very 
small, however, they are no longer able to anchor 
the grain boundaries against the grain growth forces, 
and hence the pores sinter very slowly, since they 
are stranded within the grains some distance from 
any boundary. To promote total sintering, an effective 
dispersion is added. The dispersion is critical, however, 
since it must produce sufficient drag to slow down 
grain growth, during which a particular pore is crossed 
by several migrating boundaries, but not sufficiently 
large to give rise to secondary recrystallization when a 
given pore would be stranded far from any boundary. 

The relation between grain- size, temperature and 
strain is shown in Figure 7.57 for commercially 
pure aluminium. From this diagram it is clear 
that either a critical strain-anneal treatment or a 
secondary recrystallization process may be used for 
the preparation of perfect strain-free single crystals. 

7.8.5 Annealing twins 

A prominent feature of the microstructures of most 
annealed fee metals and alloys is the presence of 
many straight- sided bands that run across grains. These 
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Critical Strain- 



Figure 7.57 Relation between grain size, deformation and 
temperature for aluminium ( after Buergers, courtesy of 
Akademie-Verlags-Gesellschaft). 

bands have a twinned orientation relative to their 
neighbouring grain and are referred to as annealing 
twins (see Chapter 4). The parallel boundaries usually 
coincide with a (111) twinning plane with the 
structure coherent across it, i.e. both parts of the twin 
hold a single (111) plane in common. 

As with formation of deformation twins, it is 
believed that a change in stacking sequence is all 
that is necessary to form an annealing twin. Such 
a change in stacking sequence may occur whenever 
a properly oriented grain boundary migrates. For 
example, if the boundary interface corresponds to a 
(111) plane, growth will proceed by the deposition of 
additional (111) planes in the usual stacking sequence 

ABCABC If, however, the next newly deposited 

layer falls into the wrong position, the sequence 
ABCABCB is produced which constitutes the first 
layer of a twin. Once a twin interface is formed, further 
growth may continue with the sequence in reverse 
order, ABCABC |BACB . . . until a second accident 
in the stacking sequence completes the twin band, 
ABCABCBACBACBABC. When a stacking error, 
such as that described above, occurs the number of 
nearest neighbours is unchanged, so that the ease of 
formation of a twin interface depends on the relative 
value of the interface energy. If this interface energy 
is low, as in copper where y tW i n < 20 mJ/m 2 twinning 
occurs frequently while if it is high, as in aluminium, 
the process is rare. 

Annealing twins are rarely (if ever) found in cast 
metals because grain boundary migration is negligible 
during casting. Worked and annealed metals show 
considerable twin band formation; after extensive grain 
growth a coarse-grained metal often contains twins 
which are many times wider than any grain that was 
present shortly after recrystallization. This indicates 
that twin bands grow in width, during grain growth, 
by migration in a direction perpendicular to the (111) 
composition plane, and one mechanism whereby this 
can occur is illustrated schematically in Figure 7.58. 
This shows that a twin may form at the corner of a 


Figure 7.58 Formation and growth of annealing twins ( from 
Burke and Turnbull, 1952; courtesy of P erg amon Press). 

grain, since the grain boundary configuration will then 
have a lower interfacial energy. If this happens the 
twin will then be able to grow in width because one of 
its sides forms part of the boundary of the growing 
grain. Such a twin will continue to grow in width 
until a second mistake in the positioning of the atomic 
layers terminates it; a complete twin band is then 
formed. In copper and its alloys Xtwin/Kgb is low and 
hence twins occur frequently, whereas in aluminium 
the corresponding ratio is very much higher and so 
twins are rare. 

Twins may develop according to the model shown in 
Figure 7.59 where during grain growth a grain contact 
is established between grains C and D. Then if the 
orientation of grain D is close to the twin orientation 
of grain C, the nucleation of an annealing twin at 
the grain boundary, as shown in Figure 7.60d, will 
lower the total boundary energy. This follows because 
the twin/D interface will be reduced to about 5% of 
the normal grain boundary energy, the energies of the 
C/A and twin/A interface will be approximately the 
same, and the extra area of interface C/twin has only a 
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Figure 7.59 Nucleation of an annealing twin during grain 
growth. 



Figure 7.60 Combination of transient and steady -state 
creep. 
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very low energy. This model indicates that the number 
of twins per unit grain boundary area only depends 
on the number of new grain contacts made during 
grain growth, irrespective of grain size and annealing 
temperature. 

7.8.6 Recrystallization textures 

The preferred orientation developed by cold work often 
changes on recrystallization to a totally different pre- 
ferred orientation. To explain this observation, Bar- 
rett and (later) Beck have put forward the ‘oriented 
growth’ theory of recrystallization textures in which 
it is proposed that nuclei of many orientations ini- 
tially form but, because the rate of growth of any 
given nucleus depends on the orientation difference 
between the matrix and growing crystal, the recrystal- 
lized texture will arise from those nuclei which have 
the fastest growth rate in the cold-worked matrix, i.e. 
those bounded by large-angle boundaries. It then fol- 
lows that because the matrix has a texture, all the 
nuclei which grow will have orientations that differ by 
30-40° from the cold-worked texture. This explains 
why the new texture in fee metals is often related to 
the old texture, by a rotation of approximately 30-40° 
around (111) axes, in bcc metals by 30° about (110) 
and in hep by 30° about (0001). However, while 
it is undoubtedly true that oriented growth provides 
a selection between favourable and unfavourable ori- 
ented nuclei, there are many observations to indicate 
that the initial nucleation is not entirely random. For 
instance, because of the crystallographic symmetry one 
would expect grains appearing in a fee texture to be 
related to rotations about all four (111) axes, i.e. eight 
orientations arising from two possible rotations about 
each of the four (111) axes. All these possible orien- 
tations are rarely (if ever) observed. 

To account for such observations, and for those 
cases where the deformation texture and the annealing 
texture show strong similarities, oriented nucleation 
is considered to be important. The oriented nucle- 
ation theory assumes that the selection of orientations 
is determined in the nucleation stage. It is generally 
accepted that all recrystallization nuclei pre-exist in the 
deformed matrix, as sub-grains, which become more 
perfect through recovery processes prior to recrystal- 
lization. It is thus most probable that there is some 
selection of nuclei determined by the representation of 
the orientations in the deformation texture, and that the 
oriented nucleation theory should apply in some cases. 
In many cases the orientations which are strongly rep- 
resented in the annealing texture are very weakly rep- 
resented in the deformed material. The most striking 
example is the ‘cube’ texture, (100) [001], found in 
most fee pure metals which have been annealed follow- 
ing heavy rolling reductions. In this texture, the cube 
axes are extremely well aligned along the sheet axes, 
and its behaviour resembles that of a single crystal. 
It is thus clear that cube-oriented grains or sub-grains 
must have a very high initial growth rate in order to 


form the remarkably strong quasi- single-crystal cube 
texture. The percentage of cubically aligned grains 
increases with increased deformation, but the sharp- 
ness of the textures is profoundly affected by alloying 
additions. The amount of alloying addition required 
to suppress the texture depends on those factors which 
affect the stacking fault energy, such as the lattice mis- 
fit of the solute atom in the solvent lattice, valency 
etc., in much the same way as that described for the 
transition of a pure metal deformation texture. 

In general, however, if the texture is to be altered 
a distribution of second-phase must either be present 
before cold rolling or be precipitated during anneal- 
ing. In aluminium, for example, the amount of cube 
texture can be limited in favour of retained rolling 
texture by limiting the amount of grain growth with 
a precipitate dispersion of Si and Fe. By balancing 
the components, earing can be minimized in drawn 
aluminium cups. In aluminium-killed steels AIN pre- 
cipitation prior to recrystallization produces a higher 
proportion of grains with {111} planes parallel to 
the rolling plane and a high R value suitable for 
deep drawing. The AIN dispersion affects sub-grain 
growth, limiting the available nuclei and increasing 
the orientation- selectivity, thereby favouring the high- 
energy {111} grains. Improved ^-values in steels in 
general are probably due to the combined effect of 
particles in homogenizing the deformed micro structure 
and in controlling the subsequent sub-grain growth. 
The overall effect is to limit the availability of nuclei 
with orientations other than {1 1 1}. 

7.9 Metallic creep 

7.9.1 Transient and steady-state creep 

Creep is the process by which plastic flow occurs 
when a constant stress is applied to a metal for a 
prolonged period of time. After the initial strain s 0 
which follows the application of the load, creep usually 
exhibits a rapid transient period of flow (stage 1) 
before it settles down to the linear steady- stage stage 
2 which eventually gives way to tertiary creep and 
fracture. Transient creep, sometimes referred to as /3- 
creep, obeys a t l/3 law. The linear stage of creep is 
often termed steady-state creep and obeys the relation 

e = Kt (7.49) 

Consequently, because both transient and steady- state 
creep usually occur together during creep at high 
temperatures, the complete curve (Figure 7.60) during 
the primary and secondary stages of creep fits the 
equation 

e = fit 1 ' 3 + Kt (7.50) 

extremely well. In contrast to transient creep, steady- 
state creep increases markedly with both temperature 
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and stress. At constant stress the dependence on tem- 
perature is given by 

£ ss = ds/dt = const, exp [— Q/kT] (7.51) 

where Q is the activation energy for steady-state creep, 
while at constant temperature the dependence on stress 
a (compensated for modulus E ) is 

s ss = const. (< a/E) n (7.52) 

Steady-state creep is therefore described by the 
equation 

£ SS = A(a/E) n exp (-Q/kT) (7.53) 

The basic assumption of the mechanism of steady- 
state creep is that during the creep process the rate 
of recovery r (i.e. decrease in strength (da/d t)) is 
sufficiently fast to balance the rate of work hardening 
h = (do/de). The creep rate (ds/dt) is then given by 

ds/dt = (da / dt) / (da / ds) = r/h (7.54) 

To prevent work hardening, both the screw and edge 
parts of a glissile dislocation loop must be able to 
escape from tangled or piled-up regions. The edge 
dislocations will, of course, escape by climb, and since 
this process requires a higher activation energy than 
cross-slip, it will be the rate-controlling process in 
steady-state creep. The rate of recovery is governed 
by the rate of climb, which depends on diffusion and 
stress such that 

r = A(a/E) P D = A(a/E) P D 0 exp [-Q/kT] 

where D is a diffusion coefficient and the stress term 
arises because recovery is faster, the higher the stress 
level and the closer dislocations are together. The 
work-hardening rate decreases from the initial rate ho 
with increasing stress, i.e. h = ho(E/o) q , thus 

£ SS = r/h = B(a/E) n D (7.55) 

where B(= A/ ho) is a constant and n(= p + q) is the 
stress exponent. 

The structure developed in creep arises from the 
simultaneous work-hardening and recovery. The dis- 
location density p increases with s and the disloca- 
tion network gets finer, since dislocation spacing is 
proportional to p -1/2 . At the same time, the dislo- 
cations tend to reduce their strain energy by mutual 
annihilation and rearrange to form low- angle bound- 
aries and this increases the network spacing. Straining 
then proceeds at a rate at which the refining action 
just balances the growth of the network by recov- 
ery; the equilibrium network size being determined by 
the stress. Although dynamical recovery can occur by 
cross-slip, the rate-controlling process in steady-state 
creep is climb whereby edge dislocations climb out of 
their glide planes by absorbing or emitting vacancies; 
the activation energy is therefore that of self-diffusion. 
Structural observations confirm the importance of the 
recovery process to steady-state creep. These show that 
sub-grains form within the original grains and, with 


increasing deformation, the sub-grain angle increases 
while the dislocation density within them remains con- 
stant. 1 The climb process may, of course, be important 
in several different ways. Thus, climb may help a glis- 
sile dislocation to circumvent different barriers in the 
structure such as a sessile dislocation, or it may lead to 
the annihilation of dislocations of opposite sign on dif- 
ferent glide planes. Moreover, because creep-resistant 
materials are rarely pure metals, the climb process may 
also be important in allowing a glissile dislocation to 
get round a precipitate or move along a grain bound- 
ary. A comprehensive analysis of steady- state creep, 
based on the climb of dislocations, has been given by 
Weertman. 

The activation energy for creep Q may be obtained 
experimentally by plotting In £ ss versus 1/T, as shown 
in Figure 7.61. Usually above 0.5 T m , Q corresponds 
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Figure 7.61 Log s versus 1/T for (a) M-AI 2 Os, 

(b) Ni-67Co-Al2 0s, showing the variation in activation 
energy above and below 0.5 T m (after Hancock, Dillamore 
and Smallman, 1972). 

1 Sub-grains do not always form during creep and in some 
metallic solid solutions where the glide of dislocations is 
restrained due to the dragging of solute atoms, the 
steady-state substructure is essentially a uniform distribution 
of dislocations. 
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Figure 7.62 Variation in activation energy Q with 
temperature for aluminium. 
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Figure 7.63 Log e versus log crfor (a) Ni-Al 2 Os, 

(b) Ni-67 C 0 -AI 2 Os ( after Hancock, Dillamore and 
Smallman, 1972). 

to the activation energy for self-diffusion E S d, in 
agreement with the climb theory, but below 0.5T m , 
Q < E S d , possibly corresponding to pipe diffusion. 
Figure 7.62 shows that three creep regimes may be 
identified and the temperature range where Q = E S d 
can be moved to higher temperatures by increasing 
the strain rate. Equation (7.55) shows that the stress 
exponent n can be obtained experimentally by plot- 
ting In £ ss versus In <r, as shown in Figure 7.63, where 
n ~ 4. While n is generally about 4 for dislocation 
creep, Figure 7.64 shows that n may vary consider- 
ably from this value depending on the stress regime; 
at low stresses (i.e. regime I) creep occurs not by dis- 
location glide and climb but by stress-directed flow of 
vacancies. 

7.9.2 Grain boundary contribution to creep 

In the creep of polycrystals at high temperatures the 
grain boundaries themselves are able to play an impor- 
tant part in the deformation process due to the fact that 



Figure 7.64 Schematic diagram showing influence of stress 
on diffusion-compensated steady-state creep. 

they may (1) slide past each other or (2) create vacan- 
cies. Both processes involve an activation energy for 
diffusion and therefore may contribute to steady- state 
creep. 

Grain boundary sliding during creep was inferred 
initially from the observation of steps at the 
boundaries, but the mechanism of sliding can be 
demonstrated on bi-crystals. Figure 7.65 shows a good 
example of grain boundary movement in a bi-crystal 
of tin, where the displacement of the straight grain 
boundary across its middle is indicated by marker 
scratches. Grain boundaries, even when specially 
produced for bi-crystal experiments, are not perfectly 
straight, and after a small amount of sliding at the 
boundary interface, movement will be arrested by 
protuberances. The grains are then locked, and the 
rate of slip will be determined by the rate of plastic 
flow in the protuberances. As a result, the rate of 
slip along a grain boundary is not constant with 
time, because the dislocations first form into piled- 
up groups, and later these become relaxed. Local 
relaxation may be envisaged as a process in which 
the dislocations in the pile-up climb towards the 
boundary. In consequence, the activation energy for 
grain boundary slip may be identified with that for 
steady-state creep. After climb, the dislocations are 
spread more evenly along the boundary, and are thus 
able to give rise to grain boundary migration, when 
sliding has temporarily ceased, which is proportional 
to the overall deformation. 

A second creep process which also involves the 
grain boundaries is one in which the boundary acts 
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Figure 7.65 Grain boundary sliding on a bi-crystal tin ( after Puttick and King, 1952). 


o 




Figure 7.66 Schematic representation of Herring-Nabarro creep; 
longitudinal faces (a) to produce creep as shown in (b). 


as a source and sink for vacancies. The mechanism 
depends on the migration of vacancies from one side 
of a grain to another, as shown in Figure 7.66, and 
is often termed Herring-Nabarro creep, after the two 
workers who originally considered this process. If, in 
a grain of sides d under a stress a, the atoms are 
transported from faces BC and AD to the faces AB and 
DC the grain creeps in the direction of the stress. To 
transport atoms in this way involves creating vacancies 
on the tensile faces AB and DC and destroying them 
on the other compressive faces by diffusion along the 
paths shown. 


with cj > c c vacancies flow from the tensile faces to the 


On a tensile face AB the stress exerts a force 
a b 2 (or oT2 2/3 ) on each surface atom and so does 
work ab 2 x b each time an atom moves forward 
one atomic spacing b (or Q l/3 ) to create a vacancy. 
The energy of vacancy formation at such a face is 
thus reduced to (E f — ab 3 ) and the concentration of 
vacancies in equilibrium correspondingly increased 
to c x — exp [(— Ei + ab 3 )/kT] = c 0 exp ab 3 /kT). The 
vacancy concentration on the compressive faces will 
be reduced to c c = c 0 exp (— ab 3 /kT). Vacancies will 
therefore flow down the concentration gradient, and 
the number crossing a face under tension to one under 
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compression will be given by Fick’s law as 

</> = — Z) v J 2 (c t — c c )/ad 

where D v is the vacancy diffusivity and a relates 
to the diffusion length. Substituting for c T , c c and 
D = (DyCob 3 ) leads to 

(p = 2dD sinh(ab 3 /kT) / ab 3 

Each vacancy created on one face and annihilated on 
the other produces a strain s = b 3 /d 3 , so that the creep 
strain-rate s = (p{b 3 /d 3 ). At high temperatures and low 
stresses this reduces to 

£ H -n = 2Dab 3 /ad 2 kT = B H _ N DaQ/d 2 kT (7.56) 

where the constant 5 H -n ~ 10. 

In contrast to dislocation creep, Herring -Nabarro 
creep varies linearly with stress and occurs at T ~ 
0.8F m with o & 10 6 N/m 2 . The temperature range over 
which vacancy-diffusion creep is significant can be 
extended to much lower temperatures (i.e. T ~ 0.5 T m ) 
if the vacancies flow down the grain boundaries rather 
than through the grains. Equation (7.56) is then mod- 
ified for Coble or grain boundary diffusion creep, and 
is given by 

^cobie = B c D gh aQco/kTd 3 (7.57) 

where co is the width of the grain boundary. Under such 
conditions (i.e. T ~ 0.5 to 0.6T m and low stresses) dif- 
fusion creep becomes an important creep mechanism in 
a number of high-technology situations, and has been 
clearly identified in magnesium-based canning materi- 
als used in gas-cooled reactors. 

7.9.3 Tertiary creep and fracture 

Tertiary creep and fracture are logically considered 
together, since the accelerating stage represents the 
initiation of conditions which lead to fracture. In 
many cases the onset of accelerating creep is an 
indication that voids or cracks are slowly but continu- 
ously forming in the material, and this has been con- 
firmed by metallography and density measurements. 
The type of fracture resulting from tertiary creep is not 
transcrystalline but grain boundary fracture. Two types 
of grain boundary fracture have been observed. The 
first occurs principally at the triple point formed where 
three grain boundaries meet, and sliding along bound- 
aries on which there is a shear stress produces stress 
concentrations at the point of conjunction sufficiently 
high to start cracks. However, under conditions of slow 
strain rate for long times, which would be expected to 
favour recovery, small holes form on grain boundaries, 
especially those perpendicular to the tensile axis, and 
these gradually grow and coalesce. 

Second-phase particles play an important part in 
the nucleation of cracks and cavities by concentrating 
stress in sliding boundaries and at the intersection of 
slip bands with particles but these stress concentrations 


are greatly reduced by plastic deformation by power- 
law creep and by diffusional processes. Cavity forma- 
tion and early growth is therefore intimately linked to 
the creep process itself and the time-to-fracture cor- 
relates well with the minimum creep rate for many 
structural materials. Fracture occurs when the larger, 
more closely spaced cavities coalesce. Creep fracture 
is discussed further in Chapter 8. 

7.9.4 Creep-resistant alloy design 

The problem of the design of engineering creep- 
resistant alloys is complex, and the optimum alloy for 
a given service usually contains several constituents 
in various states of solution and precipitation. Never- 
theless, it is worth considering some of the principles 
underlying creep-resistant behaviour in the light of the 
preceding theories. 

First, let us consider the strengthening of the solid 
solution by those mechanisms which cause disloca- 
tion locking and those which contribute to lattice 
friction hardening. The former include solute atoms 
interacting with (1) the dislocation or (2) the stacking 
fault. Friction hardening can arise from (1) the stress 
fields around individual atoms (i.e. the Mott- Nabarro 
effect), (2) clusters of solute atoms in solid solutions, 
(3) by increasing the separation of partial dislocations 
and so making climb, cross-slip and intersection more 
difficult, (4) by the solute atoms becoming attached 
to jogs and thereby impeding climb, and (5) by influ- 
encing the energies of formation and migration of 
vacancies. The alloy can also be hardened by precipi- 
tation, and it is significant that many of the successful 
industrial creep-resistant alloys are of this type (e.g. the 
nickel alloys, and both ferritic and austenitic steels). 

The effectiveness of these various methods of con- 
ferring strength on the alloy will depend on the con- 
ditions of temperature and stress during creep. All 
the effects should play some part during fast primary 
creep, but during the slow secondary creep stage the 
impeding of dislocation movement by solute locking 
effects will probably be small. This is because modern 
creep-resistant alloys are in service up to tempera- 
tures of about two-thirds the absolute melting point 
(T /T m ~ |) of the parent metal, whereas above about 
T/T m ~ ^ solute atoms will migrate as fast as dis- 
locations. Hardening which relies on clusters will be 
more difficult to remove than that which relies upon 
single atoms and should be effective up to higher tem- 
peratures. However, for any hardening mechanism to 
be really effective, whether it is due to solute atom 
clusters or actual precipitation, the rate of climb and 
cross-slip past the barriers must be slow. Accordingly, 
the most probable role of solute alloying elements in 
modern creep-resistant alloys is in reducing the rate 
of climb and cross-slip processes. The three hardening 
mechanisms listed as 3, 4 and 5 above are all effective 
in this way. From this point of view, it is clear that 
the best parent metals on which to base creep-resistant 
alloys will be those in which climb and cross-slip 
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is difficult; these include the fee and eph metals of 
low stacking-fault energy, for which the slip disloca- 
tions readily dissociate. Generally, the creep rate is 
described by the empirical relation 

s = A(a/E) n (y) m D (7.58) 

where A is a constant, n , m stress and fault energy 
exponents, respectively, and D the diffusivity; for 
fee materials m ~ 3 and n ~ 4. The reason for the 
good creep strength of austenitic and Ni-base materials 
containing Co, Cr, etc. arises from their low fault 
energy and also because of their relatively high melting 
point when D is small. 

From the above discussion it appears that a suc- 
cessful creep-resistant material would be an alloy, the 
composition of which gives a structure with a hardened 
solid- solution matrix containing a sufficient number 
of precipitated particles to force glissile partial dis- 
locations either to climb or to cross-slip to circum- 
vent them. The constitution of the Nimonic alloys, 
which consist of a nickel matrix containing dissolved 
chromium, titanium, aluminium and cobalt, is in accor- 
dance with these principles, and since no large atomic 
size factors are involved it appears that one of the func- 
tions of these additions is to lower the stacking-fault 
energy and thus widen the separation of the partial 
dislocations. A second object of the titanium and alu- 
minium alloy additions 1 is to produce precipitation, 
and in the Nimonic alloys much of the precipitate is 
Ni 3 Al. This precipitate is isomorphous with the matrix, 
and while it has a parameter difference (~^%) small 
enough to give a low interfacial energy, it is, neverthe- 
less, sufficiently large to give a source of hardening. 
Thus, since the energy of the interface provides the 
driving force for particle growth, this low-energy inter- 
face between particle and matrix ensures a low rate of 
particle growth and hence a high service temperature. 

Grain boundary precipitation is advantageous in 
reducing grain boundary sliding. Alternatively, the 
weakness of the grain boundaries may be eliminated 
altogether by using single-crystal material. Nimonic 
alloys used for turbine blades have been manufactured 
in single-crystal form by directional solidification (see 
Chapters 3 and 10). 

Dispersions are effective in conferring creep 
strength by two mechanisms. First the particle will 

! The chromium forms a spinel with NiO and hence 
improves the oxidation resistance. 


hinder a dislocation and force it to climb and cross- 
slip. Second, and more important, is the retarding 
effect on recovery as shown by some dispersions, 
Cu-A 1 2 0 3 (extruded), SAP (sintered alumina powder), 
and Ni-Th0 2 which retain their hardness almost to 
the melting point. A comparison of SAP with a 
‘conventional’ complex aluminium alloy shows that 
at 250° C there is little to choose between them but 
at 400° C SAP is several times stronger. Generally, 
the dislocation network formed by strain-hardening 
interconnects the particles and is thereby anchored 
by them. To do this effectively, the particle must be 
stable at the service temperature and remain finely 
dispersed. This depends on the solubility C, diffusion 
coefficient D and interfacial energy y \ , since the 
time to dissolve the particle is t — r 4 kT /DC Y \R 2 . In 
precipitation-hardening alloys, C is appreciable, and D 
offers little scope for adjustment; great importance is 
therefore placed on y\ as for the Ni 3 (TiAl) phase in 
Nimonic s where it is very low. 

Figure 7.63 shows that n ~ 4 both above and below 
0.5 T m for the Ni-Al 2 0 3 and Ni-Co-Al 2 0 3 alloys 
that were completely recrystallized, which contrasts 
with values very much greater than 4 for extruded TD 
nickel and other dispersion- strengthened alloys 1 con- 
taining a dislocation substructure. This demonstrates 
the importance of substructure and probably indicates 
that in completely recrystallized alloys containing a 
dispersoid, the particles control the creep behaviour, 
whereas in alloys containing a substructure the dis- 
location content is more important. Since n ~ 4 for 
the Ni- and Ni-Co-Al 2 0 3 alloys in both tempera- 
ture regimes, the operative deformation mechanism is 
likely to be the same, but it is clear from the activation 
energies, listed in Table 7.4, that the rate-controlling 
thermally activated process changes with temperature. 
The activation energy is greater at the higher tem- 
perature when it is also, surprisingly, composition (or 
stacking-fault energy) independent. 

Such behaviour may be explained, if it is assumed 
that the particles are bypassed by cross-slip (see 
Chapter 8) and this process is easy at all temperatures, 
but it is the climb of the edge segments of the cross- 
slipped dislocations that is rate-controlling. At low 
temperatures, climb would proceed by pipe-diffusion 

x To analyse these it is generally necessary to introduce a 
threshold (or friction) stress cro, so that the effective stress 
is (a - cr 0 ). 


Table 7.4 Experimentally-determined parameters from creep of M-AI 2 O 3 and Ni-Co-Al 2 03 alloys 


Test 

temperature 


773 K 


1000K 


Q(kJ mol 

) A(s -‘ ) 

QikJ mol 1 ) 

Ms- 1 ) 

A/D 0 

Alloy 

Ni 

85 

1.67 x 10 16 

276 

1.1 x 10 28 

5.5 x 10 28 

Ni-67% Co 

121 

9.95 x 10 19 

276 

2.2 x 10 28 

5.8 x 10 28 
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Figure 7.67 Schematic diagram showing the distribution of second-phase particles before and after diffusion creep. 


so that the composition-dependence relates to the vari- 
ation in the ease of pipe-diffusion along dislocations 
of different widths. At high temperatures, climb occurs 
by bulk diffusion and the absence of any composition- 
dependence is due to the fact that in these alloys 
the jog distribution is determined mainly by disloca- 
tion/particle interactions and not, as in single-phase 
alloys and in dispersion- strengthened alloys containing 
a substructure, by the matrix stacking-fault energy. The 
optimum creep resistance of dispersion- strengthened 
alloys is produced when a uniform dislocation net- 
work in a fibrous grain structure is anchored by the 
particles and recovery is minimized. Such a structure 
can reduce the creep rate by several orders of mag- 
nitude from that given in Figure 7.63, but it depends 
critically upon the working and heat-treatment used in 
fabricating the alloy. 

Second-phase particles can also inhibit diffusion 
creep. Figure 7.67 shows the distribution of particles 
before and after diffusion creep and indicates that 
the longitudinal boundaries tend to collect precipi- 
tates as vacancies are absorbed and the boundaries 
migrate inwards, while the tensile boundaries acquire 


a PFZ. Such a structural change has been observed in 
Mg-0.5%Zr ( Magnox ZR55 ) at 400°C and is accom- 
panied by a reduced creep rate. It is not anticipated 
that diffusion is significantly affected by the presence 
of particles and hence the effect is thought to be due to 
the particles affecting the vacancy-absorbing capabil- 
ities of the grain boundaries. Whatever mechanism is 
envisaged for the annihilation of vacancies at a grain 
boundary, the climb-glide of grain boundary disloca- 
tions is likely to be involved and such a process will 
be hindered by the presence of particles. 


7.10 Deformation mechanism maps 

The discussion in this chapter has emphasized that 
over a range of stress and temperature an alloy is 
capable of deforming by several alternative and inde- 
pendent mechanisms, e.g. dislocation creep with either 
pipe diffusion at low temperatures and lattice diffusion 
at high temperatures being the rate-controlling mecha- 
nism, and diffusional creep with either grain-boundary 



252 Modern Physical Metallurgy and Materials Engineering 


(a) temperature °C (b) 

0 500 1000 



Figure 7.68 Deformation-mechanism maps for (a) nickel, (h) nickel-based superalloy (after M. F. Ashby). 


diffusion or lattice diffusion being important. In a par- 
ticular range of temperature, one of these mechanisms 
is dominant and it is therefore useful in engineering 
application to identify the operative mechanism for a 
given stress -temperature condition, since it is ineffec- 
tive to change the metallurgical factors to influence, for 
example, a component deforming by power-law creep 
controlled by pipe diffusion if the operative mechanism 
is one of Herring -Nabarro creep. 

The various alternative mechanisms are displayed 
conveniently on a deformation-mechanism map in 
which the appropriate stress, i.e. shear stress or equiv- 
alent stress, compensated by modulus on a log scale, 
is plotted against homologous temperature T /T m as 
shown in Figure 7.68 for nickel and a nickel-based 
superalloy with a grain size of 100 pm. By comparing 
the diagrams it is evident that solid solution strength- 
ening and precipitation-hardening have raised the yield 
stress and reduced the dislocation creep field. The 
shaded boxes shown in Figure 7.68 indicate the typi- 
cal stresses and temperatures to which a turbine blade 
would be subjected; it is evident that the mechanism 
of creep during operation has changed and, indeed, the 
creep rate is reduced by several orders of magnitude. 

7.11 Metallic fatigue 

7.11.1 Nature of fatigue failure 

The term fatigue applies to the behaviour of a metal 
which, when subjected to a cyclically variable stress 
of sufficient magnitude (often below the yield stress) 
produces a detectable change in mechanical properties. 
In practice, a large number of service failures are due 
to fatigue, and so engineers are concerned mainly with 
fatigue failure where the specimen is actually separated 
into two parts. Some of these failures can be attributed 


to poor design of the component, but in some can 
be ascribed to the condition of the material. Conse- 
quently, the treatment of fatigue may be conveniently 
divided into three aspects: (1) engineering consider- 
ations, (2) gross metallurgical aspects, and (3) fine- 
scale structural and atomic changes. 

The fatigue conditions which occur in service 
are usually extremely complex. Common failures 
are found in axles where the eccentric load at a 
wheel or pulley produces a varying stress which is 
a maximum in the skin of the axle. Other examples, 
such as the flexure stresses produced in aircraft wings 
and in undercarriages during ground taxi-ing, do, 
however, emphasize that the stress system does not 
necessarily vary in a regular sinusoidal manner. The 
series of aircraft disasters attributed to pressurized- 
cabin failures is perhaps the most spectacular example 
of this type of fatigue failure. 

7.11.2 Engineering aspects of fatigue 

In laboratory testing of materials the stress system is 
usually simplified, and both the Woehler and push-pull 
type of test are in common use. The results are usually 
plotted on the familiar S-N curve (i.e. stress versus 
the number of cycles to failure, usually plotted on a 
logarithmic scale). Ferritic steels may be considered 
to exhibit a genuine fatigue limit with a fatigue ratio 
S/TS ~ 0.5. However, other materials, such as alu- 
minium or copper-based alloys, certainly those of the 
age-hardening variety, definitely do not show a sharp 
discontinuity in the S-N curve. For these materials 
no fatigue limit exists and all that can be specified is 
the endurance limit at N cycles. The importance of 
the effect is illustrated by the behaviour of commer- 
cial aluminium-based alloys containing zinc, magne- 
sium and copper. Such an alloy may have a TS of 
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617 MN/m 2 but the fatigue stress for a life of 10 8 
cycles is only 154 MN/m 2 (i.e. a fatigue ratio at 10 8 
cycles of 0.25). 

The amplitude of the stress cycle to which the spec- 
imen is subjected is the most important single variable 
in determining its life under fatigue conditions, but the 
performance of a material is also greatly affected by 
various other conditions, which may be summarized 
as follows: 

1. Surface preparation Since fatigue cracks frequently 
start at or near the surface of the component, the 
surface condition is an important consideration in 
fatigue life. The removal of machining marks and 
other surface irregularities invariably improves the 
fatigue properties. Putting the surface layers under 
compression by shot peening or surface treatment 
improves the fatigue life. 

2. Effect of temperature Temperature affects the fatigue 
properties in much the same way as it does the ten- 
sile strength (TS); the fatigue strength is highest at 
low temperatures and decreases gradually with ris- 
ing temperature. For mild steel the ratio of fatigue 
limit to TS remains fairly constant at about 0.5, 
while the ratio of fatigue limit to yield stress varies 
over much wider limits. However, if the tempera- 
ture is increased above about 100°C, both the tensile 
strength and the fatigue strength of mild steel show 
an increase, reaching a maximum value between 
200° C and 400° C. This increase, which is not com- 
monly found in other materials, has been attributed 
to strain- ageing. 

3. Frequency of stress cycle In most metals the fre- 
quency of the stress cycle has little effect on the 
fatigue life, although lowering the frequency usually 
results in a slightly reduced fatigue life. The effect 
becomes greater if the temperature of the fatigue 
test is raised, when the fatigue life tends to depend 
on the total time of testing rather than on the num- 
ber of cycles. With mild steel, however, experiments 



show that the normal speed effect is reversed in a 
certain temperature range and the number of cycles 
to failure increases with decrease in the frequency 
of the stress cycle. This effect may be correlated 
with the influence of temperature and strain-rate 
on the TS. The temperature at which the tensile 
strength reaches a maximum depends on the rate 
of strain, and it is, therefore, not surprising that the 
temperature at which the fatigue strength reaches a 
maximum depends on the cyclic frequency. 

4. Mean stress For conditions of fatigue where the 
mean stress, i.e. 

A oN\ = (cr max + <7 min )/2 

does not exceed the yield stress a y , then the rela- 
tionship 

AaNf = const. (7.59) 

known as Basquin’s law, holds over the range 10 2 
to ~10 5 cycles, i.e. N less than the knee of the S-N 
curve, where a ~ ^ and Nf the number of cycles 
to failure. For low cycle fatigue with A a > a y then 
Basquin’s law no longer holds, but a reasonable 
relationship 

Ae p ./Vf = D b = const. (7.60) 

known as the Coffin -Manson law, is found where 
Ae v is the plastic strain range, b ~ 0.6, and D is the 
ductility of the material. If the mean stress becomes 
tensile a lowering of the fatigue limit results. Several 
relationships between fatigue limit and mean stress 
have been suggested, as illustrated in Figure 7.69a. 
However, there is no theoretical reason why a mate- 
rial should follow any given relationship and the 
only safe rule on which to base design is to carry out 
prior tests on the material concerned to determine its 
behaviour under conditions similar to those it will 
meet in service. Another common engineering rela- 
tionship frequently used, known as Miner’ s concept 



^al 

Figure 7.69 Fatigue relationships. 
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of cumulative damage, is illustrated in Figure 7.69b. 
This hypothesis states that damage can be expressed 
in terms of the number of cycles applied divided by 
the number to produce failure at a given stress level. 
Thus, if a maximum stress of value Si is applied to a 
specimen for n i cycles which is less than the fatigue 
life Ni, and then the maximum stress is reduced to 
a value equal to S 2 , the specimen is expected to 
fail after n 2 cycles, since according to Miner the 
following relationship will hold 

m/Ni + n 2 /N 2 + . . . = T,n/N = 1 (7.61) 

5. Environment Fatigue occurring in a corrosive envi- 
ronment is usually referred to as corrosion fatigue. 
It is well known that corrosive attack by a liquid 
medium can produce etch pits which may act as 
notches, but when the corrosive attack is simultane- 
ous with fatigue stressing, the detrimental effect is 
far greater than just a notch effect. Moreover, from 
microscopic observations the environment appears 
to have a greater effect on crack propagation than 
on crack initiation. For most materials even atmo- 
spheric oxygen decreases the fatigue life by influ- 
encing the speed of crack propagation, and it is 
possible to obtain a relationship between fatigue life 
and the degree of vacuum in which the specimen has 
been held. 

It is now well established that fatigue starts at the 
surface of the specimen. This is easy to understand 
in the Woehler test because, in this test, it is there 
that the stress is highest. However, even in push-pull 
fatigue, the surface is important for several reasons: 
(1) slip is easier at the surface than in the interior 
of the grains, (2) the environment is in contact with 
the surface, and (3) any specimen misalignment will 
always give higher stresses at the surface. Accordingly, 
any alteration in surface properties must bring about 
a change in the fatigue properties. The best fatigue 
resistance occurs in materials with a worked surface 
layer produced by polishing with emery, shot-peening 
or skin-rolling the surface. This beneficial effect of a 
worked surface layer is principally due to the fact that 
the surface is put into compression, but the increased 
TS as a result of work hardening also plays a part. 
Electropolishing the specimen by removing the sur- 
face layers usually has a detrimental effect on the 
fatigue properties, but other common surface prepara- 
tions such as nitriding and carburizing, both of which 
produce a surface layer which is in compression, may 
be beneficial. Conversely, such surface treatments as 
the decarburizing of steels and the cladding of alu- 
minium alloys with pure aluminium, increase their 
susceptibility to fatigue. 

The alloy composition and thermal and mechanical 
history of the specimen are also of importance in the 
fatigue process. Any treatment which increases the 
hardness or yield strength of the material will increase 
the level of the stress needed to produce slip and, as 
we shall see later, since the fundamental processes 


of fatigue are largely associated with slip, this leads 
directly to an increase in fatigue strength. It is also 
clear that grain size is a relevant factor: the smaller 
the grain size, the higher is the fatigue strength at a 
given temperature. 

The fatigue processes in stable alloys are essen- 
tially the same as those of pure metals but there is, of 
course, an increase in fatigue strength. However, the 
processes in unstable alloys and in materials exhibit- 
ing a yield point are somewhat different. In fatigue, 
as in creep, structural instability frequently leads to 
enhancement of the fundamental processes. In all cases 
the approach to equilibrium is more complete, so 
that in age-hardening materials, solution-treated speci- 
mens become harder and fully aged specimens become 
softer. The changes which occur are local rather than 
general, and are associated with the enhanced diffusion 
brought about by the production of vacancies during 
the fatigue test. Clearly, since vacancy mobility is a 
thermally activated process such effects can be sup- 
pressed at sufficiently low temperatures. 

In general, non-ferrous alloys do not exhibit the type 
of fatigue limit shown by mild steel. One exception 
to this generalization is the alloy aluminium 2-7% 
magnesium, 0.5% manganese, and it is interesting to 
note that this alloy also has a sharp yield point and 
shows Liiders markings in an ordinary tensile test. 
Accordingly, it has been suggested that the fatigue 
limit occupies a similar place in the field of alternat- 
ing stresses to that filled by the yield point in uni- 
directional stressing. Stresses above the fatigue limit 
readily unlock the dislocations from their solute atom 
atmospheres, while below the fatigue limit most dis- 
locations remain locked. In support of this view, it 
is found that when the carbon and nitrogen content 
of mild steel is reduced, by annealing in wet hydro- 
gen, striking changes take place in the fatigue limit 
(Figure 7.5) as well as in the sharp yield point. 

7.11.3 Structural changes accompanying 
fatigue 

Observations of the structural details underlying 
fatigue hardening show that in polycrystals large 
variations in slip-band distributions and the amount of 
lattice misorientation exist from one grain to another. 
Because of such variations it is difficult to typify 
structural changes, so that in recent years this structural 
work has been carried out more and more on single 
crystals; in particular, copper has received considerable 
attention as being representative of a typical metal. 
Such studies have now established that fatigue occurs 
as a result of slip, the direction of which changes 
with the stress cycle, and that the process continues 
throughout the whole of the test (shown, for example, 
by interrupting a test and removing the slip bands by 
polishing; the bands reappear on subsequent testing). 

Moreover, four stages in the fatigue life of a 
specimen are distinguishable; these may be summa- 
rized as follows. In the early stages of the test, the 
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whole of the specimen hardens. After about 5% of the 
life, slip becomes localized and persistent slip bands 
appear; they are termed persistent because they reap- 
pear and are not permanently removed by electropol- 
ishing. Thus, reverse slip does not continue throughout 
the whole test in the bulk of the metal (the matrix). 
Electron microscope observations show that metal is 
extruded from the slip bands and that fine crevices 
called intrusions are formed within the band. Dur- 
ing the third stage of the fatigue life the slip bands 
grow laterally and become wider, and at the same time 
cracks develop in them. These cracks spread initially 
along slip bands, but in the later stages of fracture the 
propagation of the crack is often not confined to cer- 
tain crystallographic directions and catastrophic rup- 
ture occurs. These two important crack growth stages, 
i.e. stage I in the slip band and stage II roughly perpen- 
dicular to the principal stress, are shown in Figure 7.70 
and are influenced by the formation of localized (per- 
sistent) slip bands (i.e. PSBs). However, PSBs are not 
clearly defined in low stacking fault energy, solid solu- 
tion alloys. 

Cyclic stressing therefore produces plastic deforma- 
tion which is not fully reversible and the build-up of 
dislocation density within grains gives rise to fatigue 
hardening with an associated structure which is char- 
acteristic of the strain amplitude and the ability of the 
dislocations to cross-slip, i.e. temperature and SFE. 
The non-reversible flow at the surface leads to intru- 
sions, extrusions and crack formation in PSBs. These 
two aspects will now be considered separately and in 
greater detail. 

Fatigue hardening If a single or poly crystalline 
specimen is subjected to many cycles of alternat- 
ing stress, it becomes harder than a similar specimen 
extended uni-directionally by the same stress applied 
only once. This may be demonstrated by stopping 
the fatigue test and performing a static tensile test 
on the specimen when, as shown in Figure 7.71, the 
yield stress is increased. During the process, persis- 
tent slip bands appear on the surface of the specimen 
and it is in such bands that cracks eventually form. 
The behaviour of a fatigue-hardened specimen has 



Figure 7.70 Persistent slip band (P SB) formation in fatigue, 
and stage I and stage II crack growth. 

two unusual features when compared with an ordi- 
nary work-hardened material. The fatigue-hardened 
material, having been stressed symmetrically, has the 
same yield stress in compression as in tension, whereas 
the work-hardened specimen (e.g. prestrained in ten- 
sion) exhibits a Bauschinger effect, i.e. weaker in com- 
pression than tension. It arises from the fact that the 
obstacles behind the dislocation are weaker than those 
resisting further dislocation motion, and the pile-up 
stress causes it to slip back under a reduced load in 
the reverse direction. The other important feature is 


(\j 



Figure 7.71 Stress-strain curves for copper after increasing amounts of fatigue testing (after Broom and Ham, 1959). 
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that the temperature-dependence of the hardening pro- 
duced by fatigue is significantly greater than that of 
work-hardening and, because of the similarity with 
the behaviour of metals hardened by quenching and 
by irradiation, it has been attributed to the effect of 
vacancies and dislocation loops created during fatigue. 

At the start of cyclic deformation the initial slip 
bands (Figure 7.72a) consist largely of primary dis- 
locations in the form of dipole and multipole arrays; 
the number of loops is relatively small because the 
frequency of cross-slip is low. As the specimen work- 
hardens slip takes place between the initial slip bands, 
and the new slip bands contain successively more 
secondary dislocations because of the internal stress 
arising from nearby slip bands (Figure 7.72b). When 
the specimen is completely filled with slip bands, the 
specimen has work-hardened and the softest regions 
are now those where slip occurred originally since 
these bands contain the lowest density of secondary 
dislocations. Further slip and the development of PSBs 
takes place within these original slip bands, as shown 
schematically in Figure 7.72c 

As illustrated schematically in Figure 7.73, TEM of 
copper crystals shows that the main difference between 
the matrix and the PSBs is that in the matrix the dense 
arrays of edge dislocation (di- and multipoles) are in 
the form of large veins occupying about 50% of the 
volume, whereas they form a ‘ladder’ -type structure 
within walls occupying about 10% of the volume in 
PSBs. The PSBs are the active regions in the fatigue 
process while the matrix is associated with the inactive 
parts of the specimen between the PSBs. Steady - 
state deformation then takes place by the to-and-fro 
glide of the same dislocations in the matrix, whereas 
an equilibrium between dislocation multiplication and 
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Figure 7.73 Schematic diagram showing (a) vein structure 
of matrix and (b) ladder structure of PSBs. 


annihilation exists in the PSBs. Multiplication occurs 
by bowing-out of the walls and annihilation takes place 
by interaction with edge dislocations of opposite sign 
(~75b apart) on glide planes in the walls and of screw 
dislocations (~200b apart) on glide planes in the low- 
dislocation channels, the exact distance depending on 
the ease of cross-slip. 

7.11.4 Crack formation and fatigue failure 

Extrusions, intrusions and fatigue cracks can be formed 
at temperatures as low as 4 K where thermally acti- 
vated movement of vacancies does not take place. 
Such observations indicate that the formation of intru- 
sions and cracks cannot depend on either chemical or 
thermal action and the mechanism must be a purely 
geometrical process which depends on cyclic stressing. 

Two general mechanisms have been suggested. The 
first, the Cottrell ‘ratchet’ mechanism, involves the 
use of two different slip systems with different direc- 
tions and planes of slip, as is shown schematically 
in Figure 7.74. The most favoured source (e.g. S i in 
Figure 7.74a) produces a slip step on the surface at P 
during a tensile half-cycle. At a slightly greater stress 
in the same half-cycle, the second source S 2 produces 



Figure 7.72 Formation of persistent slip bands (PSBs) 
during fatigue. 


Figure 7.74 Formation of intrusions and extrusions ( after 
Cottrell; courtesy of John Wiley and Sons). 
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a second step at Q (Figure 7.74b). During the com- 
pression half-cycle, the source S\ produces a surface 
step of opposite sign at P' (Figure 7.74c), but, owing 
to the displacing action of S 2 , this is not in the same 
plane as the first and thus an intrusion is formed. The 
subsequent operation of S 2 produces an extrusion at 
QQ (Figure 7.74d) in a similar manner. Such a mech- 
anism requires the operation of two slip systems and, 
in general, predicts the occurrence of intrusions and 
extrusions with comparable frequency, but not in the 
same slip band. 

The second mechanism, proposed by Mott, involves 
cross-slip resulting in a column of metal extruded from 
the surface and a cavity is left behind in the interior 
of the crystal. One way in which this could happen 
is by the cyclic movement of a screw dislocation 
along a closed circuit of crystallographic planes, as 
shown in Figure 7.75. During the first half-cycle the 
screw dislocation glides along two faces ABCD and 
BB'C'C of the band, and during the second half-cycle 
returns along the faces B'C'A'D and A'D'DA. Unlike 
the Cottrell mechanism this process can be operated 
with a single slip direction, provided cross-slip can 
occur. 

Neither mechanism can fully explain all the 
experimental observations. The interacting slip mech- 
anism predicts the occurrence of intrusions and extru- 
sions with comparable frequency but not, as is often 
found, in the same slip band. With the cross-slip mech- 
anism, there is no experimental evidence to show that 
cavities exist beneath the material being extruded. It 
may well be that different mechanisms operate under 
different conditions. 

In a poly crystalline aggregate the operation of 
several slip modes is necessary and intersecting slip 
unavoidable. Accordingly, the widely differing fatigue 
behaviour of metals may be accounted for by the 
relative ease with which cross-slip occurs. Thus, 
those factors which affect the onset of stage III in 
the work-hardening curve will also be important in 
fatigue, and conditions suppressing cross-slip would, 
in general, increase the resistance to fatigue failure, 
i.e. low stacking-fault energy and low temperatures. 
Aluminium would be expected to have poor fatigue 
properties on this basis but the unfavourable fatigue 
characteristics of the high-strength aluminium alloys 



Figure 7.75 Formation of an extrusion and associated 
cavity by the Mott mechanism. 


is probably also due to the unstable nature of the alloy 
and to the influence of vacancies. 

In pure metals and alloys, transgranular cracks ini- 
tiate at intrusions in PSBs or at sites of surface rough- 
ness associated with emerging planar slip bands in 
low SFE alloys. Often the microcrack forms at the 
PSB -matrix interface where the stress concentration is 
high. In commercial alloys containing inclusions or 
second-phase particles, the fatigue behaviour depends 
on the particle size. Small particles ~0. 1 pm can have 
beneficial effects by homogenizing the slip pattern 
and delaying fatigue-crack nucleation. Larger parti- 
cles reduce the fatigue life by both facilitating crack 
nucleation by slip band/particle interaction and increas- 
ing crack growth rates by interface decohesion and 
voiding within the plastic zone at the crack tip. The 
formation of voids at particles on grain boundaries 
can lead to intergranular separation and crack growth. 
The preferential deformation of ‘soft’ precipitate-free 
zones (PFZs) associated with grain boundaries in 
age-hardened alloys also provides a mechanism of 
intergranular fatigue-crack initiation and growth. To 
improve the fatigue behaviour it is therefore necessary 
to avoid PFZs and obtain a homogeneous deforma- 
tion structure and uniform precipitate distribution by 
heat-treatment; localized deformation in PFZs can be 
restricted by a reduction in grain size. 

From the general appearance of a typical fatigue 
fracture, shown in Figure 7.76, one can distinguish two 
distinct regions. The first is a relatively smooth area, 
through which the fatigue crack has spread slowly. 
This area usually has concentric marks about the point 
of origin of the crack which correspond to the positions 
at which the crack was stationary for some period. 
The remainder of the fracture surface shows a typi- 
cally rough transcrystalline fracture where the failure 
has been catastrophic. Electron micrographs of the rel- 
atively smooth area show that this surface is covered 
with more or less regular contours perpendicular to the 
direction of the propagation front. These fatigue stri- 
ations represent the successive positions of the propa- 
gation front and are spaced further apart the higher the 



Figure 7.76 A schematic fatigue fracture. 
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Figure 7.77 Schematic illustration of the formation of fatigue striations. 


velocity of propagation. They are rather uninfluenced 
by grain boundaries and in metals where cross-slip is 
easy (e.g. mild steel or aluminium) may be wavy in 
appearance. Generally, the lower the ductility of the 
material, the less well defined are the striations. 

Stage II growth is rate-controlling in the fatigue fail- 
ure of most engineering components, and is governed 
by the stress intensity at the tip of the advancing crack. 
The striations seen on the fracture surface may form by 
a process of plastic blunting at the tip of the crack, as 
shown in Figure 7.77. In (a) the crack under the ten- 
sile loading part of the cycle generates shear stresses 
at the tip. With increasing tensile load the crack opens 
up and new surface is created (b), separation occurs 
in the slip band and ‘ears’ are formed at the end of 
the crack. The plastic deformation causes the crack to 
be both extended and blunted (c). On the compressive 
part of the cycle the crack begins to close (d), the shear 
stresses are reversed and with increasing load the crack 
almost closes (e). In this part of the cycle the new sur- 
face folds and the ears correspond to the new striations 
on the final fracture surface. A one-to-one correlation 
therefore exists between the striations and the opening 
and closing with ear formation. Crack growth contin- 
ues in this manner until it is long enough to cause the 
final instability when either brittle or ductile (due to 
the reduced cross-section not being able to carry the 
load) failure occurs. In engineering alloys, rather than 
pure metals, which contain inclusions or second-phase 
particles, cracking or voiding occurs ahead of the orig- 
inal crack tip rather than in the ears when the tensile 
stress or strain reaches a critical value. This macro- 
scopic stage of fracture is clearly of importance to 
engineers in predicting the working life of a component 
and has been successfully treated by the application of 
fracture mechanics as discussed in Chapter 8. 

7.11.5 Fatigue at elevated temperatures 

At ambient temperature the fatigue process involves 
intracrystalline slip and surface initiation of cracks, 


followed by transcrystalline propagation. When 
fatigued at elevated temperatures 0.57V, pure metals 
and solid solutions show the formation of discrete 
cavities on grain boundaries, which grow, link up 
and finally produce failure. It is probable that 
vacancies produced by intracrystalline slip give rise 
to a supersaturation which causes the vacancies to 
condense on those grain boundaries that are under a 
high shear stress where the cavities can be nucleated 
by a sliding or ratchet mechanism. It is considered 
unlikely that grain boundary sliding contributes to 
cavity growth, increasing the grain size decreases the 
cavity growth because of the change in boundary area. 
Magnox (Mg) and alloys used in nuclear reactors up to 
0.75T m readily form cavities, but the high-temperature 
nickel-base alloys do not show intergranular cavity 
formation during fatigue at temperatures within their 
normal service range, because intracrystalline slip 
is inhibited by / precipitates. Above about 0.77V, 
however, the y' precipitates coarsen or dissolve and 
fatigue then produce cavities and eventually cavity 
failure. 
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Chapter 8 

Strengthening and toughening 


8.1 Introduction 

The production of materials which possess consider- 
able strength at both room and elevated temperatures 
is of great practical importance. We have already seen 
how alloying, solute-dislocation interaction, grain size 
control and cold- working can give rise to an increased 
yield stress. Of these methods, refining the grain size is 
of universal application to materials in which the yield 
stress has a significant dependence upon grain size. 
In certain alloy systems, it is possible to produce an 
additional increase in strength and hardness by heat- 
treatment alone. Such a method has many advantages, 
since the required strength can be induced at the most 
convenient stage of production or fabrication; more- 
over, the component is not sent into service in a highly 
stressed, plastically deformed state. The basic require- 
ment for such a special alloy is that it should undergo 
a phase transformation in the solid state. One type of 
alloy satisfying this requirement, already considered, 
is that which can undergo an order- disorder reac- 
tion; the hardening accompanying this process (similar 
in many ways to precipitation-hardening) is termed 
order-hardening. However, conditions for this form of 
hardening are quite stringent, so that the two prin- 
cipal hardening methods, commonly used for alloys, 
are based upon (1) precipitation from a supersaturated 
solid solution and (2) eutectoid decomposition. 

In engineering applications, strength is, without 
doubt, an important parameter. However, it is by no 
means the only important one and usually a material 
must provide a combination of properties. Some duc- 
tility is generally essential, enabling the material to 
relieve stress concentrations by plastic deformation and 
to resist fracture. The ability of materials to resist crack 
propagation and fracture, known generally as tough- 
ness, will be discussed in this chapter. Fracture can 
take many forms; some special forms, such as brit- 
tle fracture by cleavage, ductile fracture by microvoid 
coalescence, creep fracture by triple-point cracking and 
fatigue cracking, will be examined. 


This chapter primarily concerns alloy behaviour, 
partly because of the inherent versatility of alloy sys- 
tems and partly because the research background to 
much of the current understanding of strength, tough- 
ness and fracture is essentially metallurgical. However, 
it is often possible to extend the basic principles to non- 
metallic materials, particularly in the case of fracture 
processes. This will be apparent later, in Chapter 10, 
when we describe how the unique transformation char- 
acteristics of zirconia can be used to inhibit crack 
propagation in a brittle ceramic such as alumina. Meth- 
ods for toughening glasses are described in the same 
chapter. In Chapter 1 1 we consider the strengthening 
and toughening effects produced when plastics, met- 
als and ceramics are reinforced with filaments to form 
composite materials. 

8.2 Strengthening of non-ferrous 
alloys by heat-treatment 

8.2.1 Precipitation-hardening of Al-Cu alloys 

8. 2. 1.1 Precipitation from supersaturated solid 
solution 

The basic requirements of a precipitation-hardening 
alloy system is that the solid solubility limit should 
decrease with decreasing temperature as shown 
in Figure 8.1 for the Al-Cu system. During the 
precipitation-hardening heat-treatment procedure the 
alloy is first solution heat-treated at the high 
temperature and then rapidly cooled by quenching 
into water or some other cooling medium. The rapid 
cooling suppresses the separation of the 0-phase 
so that the alloy exists at the low temperature in 
an unstable supersaturated state. If, however, after 
quenching, the alloy is allowed to ‘age’ for a sufficient 
length of time, the second phase precipitates out. 
This precipitation occurs by a nucleation and growth 
process, fluctuations in solute concentration providing 
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small clusters of atoms in the lattice which act as nuclei 
for the precipitate. However, the size of the precipitate 
becomes finer as the temperature at which precipitation 
occurs is lowered, and extensive hardening of the 
alloy is associated with a critical dispersion of the 
precipitate. If, at any given temperature, ageing is 
allowed to proceed too far, coarsening of the particles 
occurs (i.e. the small ones tend to redissolve, and 
the large ones to grow still larger as discussed in 
Section 8.2.6) and the numerous finely dispersed, 
small particles are gradually replaced by a smaller 
number of more widely dispersed, coarser particles. 
In this state the alloy becomes softer, and it is then 
said to be in the over- aged condition (see Figure 8.2). 

8. 2. 1.2 Changes in properties accompanying 
precipitation 

The actual quenching treatment gives rise to small 
changes in many of the mechanical and physical prop- 
erties of alloys because both solute atoms and point 
defects in excess of the equilibrium concentration are 
retained during the process, and because the quench 
itself often produces lattice strains. Perhaps the prop- 
erty most markedly affected is the electrical resistance 
and this is usually considerably increased. In con- 
trast, the mechanical properties are affected relatively 
much less. 

On ageing, the change in properties in a 
quenched material is more marked and, in particular, 
the mechanical properties often show striking 
modifications. For example, the tensile strength of 
Duralumin (i.e. an aluminium-4% copper alloy 
containing magnesium, silicon and manganese) may 
be raised from 0.21 to 0.41 GN/m 2 while that of 
a Cu-2Be alloy may be increased from 0.46 to 



wt % Cu 

Figure 8.1 Al-rich Al-Cu binary diagram showing GP [ 1 ], 
0" and 0 r solvus lines (dotted). 


1.23 GN/m 2 . The structure-sensitive properties such as 
hardness, yield stress, etc. are, of course, extremely 
dependent on the structural distribution of the 
phases and, consequently, such alloys usually exhibit 
softening as the finely dispersed precipitates coarsen. 

A simple theory of precipitation, involving the 
nucleation and growth of particles of the expected 
new equilibrium phase, leads one to anticipate that 
the alloy would show a single hardening peak, the 
electrical resistivity a decrease, and the lattice param- 
eter an increase (assuming the solute atom is smaller 
than the solvent atom) as the solute is removed from 
solution. Such property changes are found in practice, 
but only at low supersaturations and high ageing tem- 
peratures. At higher supersaturations and lower ageing 
temperatures the various property changes are not con- 
sistent with such a simple picture of precipitation; the 
alloy may show two or more age-hardening peaks, and 
the electrical resistivity and lattice parameter may not 
change in the anticipated manner. A hardening pro- 
cess which takes place in two stages is shown in alu- 
minium-copper alloys (Figure 8.2a) where the initial 
hardening occurs without any attendant precipitation 
being visible in the light microscope and, moreover, 
is accompanied by a decrease in conductivity and no 
change in lattice parameter. Such behaviour may be 
accounted for if precipitation is a process involving 
more than one stage. The initial stage of precipitation, 
at the lower ageing temperatures, involves a cluster- 
ing of solute atoms on the solvent lattice sites to form 
zones or clusters, coherent with the matrix; the zones 
cannot be seen in the light microscope and for this rea- 
son this stage was at one time termed pre-precipitation. 
At a later stage of the ageing process these clusters 
break away from the matrix lattice to form distinct 
particles with their own crystal structure and a definite 
interface. These hypotheses were confirmed originally 
by structural studies using X-ray diffraction techniques 
but nowadays the so-called pre-precipitation effects 
can be observed directly in the electron microscope. 

Even though clustering occurs, the general kinetic 
behaviour of the precipitation process is in agreement 
with that expected on thermodynamic grounds. From 
Figure 8.2 it is evident that the rate of ageing increases 
markedly with increasing temperature while the peak 
hardness decreases. Two-stage hardening takes place 
at low ageing temperatures and is associated with 
high maximum hardness, while single-stage hardening 
occurs at higher ageing temperatures, or at lower 
ageing temperatures for lower solute contents. 

Another phenomenon commonly observed in 
precipitation-hardening alloys is reversion or retrogres- 
sion. If an alloy hardened by ageing at low temperature 
is subsequently heated to a higher ageing temperature 
it softens temporarily, but becomes harder again on 
more prolonged heating. This temporary softening, or 
reversion of the hardening process, occurs because the 
very small nuclei or zones precipitated at the low tem- 
perature are unstable when raised to the higher ageing 
temperature, and consequently they redissolve and the 
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(a) 



Figure 8.2 The ageing of aluminium- copper alloys at 

(a) 130° C and (b) at 190° C ( after Silcock, Heal and Hardy, 

1953-4). 


alloy becomes softer; the temperature above which the 
nuclei or zones dissolve is known as the solvus tem- 
perature; Figure 8.1 shows the solvus temperatures for 
GP zones, 0 0 ' and 0 . On prolonged ageing at the 
higher temperature larger nuclei, characteristic of that 
temperature, are formed and the alloy again hardens. 
Clearly, the reversion process is reversible, provided 
re-hardening at the higher ageing temperature is not 
allowed to occur. 

8. 2. 1.3 Structural changes during precipitation 
Early metallographic investigations showed that the 
micro structural changes which occur during the initial 
stages of ageing are on too fine a scale to be resolved 
by the light microscope, yet it is in these early stages 
that the most profound changes in properties are found. 
Accordingly, to study the process, it is necessary to 
employ the more sensitive and refined techniques of 
X-ray diffraction and electron microscopy. 

The two basic X-ray techniques, important in study- 
ing the regrouping of atoms during the early stages 
of ageing, depend on the detection of radiation scat- 
tered away from the main diffraction lines or spots 
(see Chapter 5). In the first technique, developed 
independently by Guinier and Preston in 1938, the 
Laue method is used. They found that the single- 
crystal diffraction pattern of an aluminium -copper 


alloy developed streaks extending from an aluminium 
lattice reflection along (100)ai directions. This was 
attributed to the formation of copper-rich regions of 
plate-like shape on {100} planes of the aluminium 
matrix (now called Guinier- Preston zones or GP 
zones). The net effect of the regrouping is to mod- 
ify the scattering power of, and spacing between, very 
small groups of {100} planes throughout the crystal. 
However, being only a few atomic planes thick, the 
zones produce the diffraction effect typical of a two- 
dimensional lattice, i.e. the diffraction spot becomes a 
diffraction streak. In recent years the Laue method has 
been replaced by a single-crystal oscillation technique 
employing monochromatic radiation, since interpreta- 
tion is made easier if the wavelength of the X-rays used 
is known. The second technique makes use of the phe- 
nomenon of scattering of X-rays at small angles (see 
Chapter 5). Intense small-angle scattering can often 
be observed from age-hardening alloys (as shown in 
Figures 8.3 and 8.5) because there is usually a differ- 
ence in electron density between the precipitated zone 
and the surrounding matrix. However, in alloys such 
as aluminium -magnesium or aluminium- silicon the 
technique is of no value because in these alloys the 
small difference in scattering power between the alu- 
minium and silicon or magnesium atoms, respectively, 
is insufficient to give rise to appreciable scattering at 
small angles. 

With the advent of the electron microscope the age- 
ing of aluminium alloys was one of the first subjects to 
be investigated with the thin-foil transmission method. 
Not only can the detailed structural changes which 
occur during the ageing process be followed, but elec- 
tron diffraction pictures taken from selected areas of 
the specimen while it is still in the microscope enable 
further important information on the structure of the 
precipitated phase to be obtained. Moreover, under 
some conditions the interaction of moving dislocations 
and precipitates can be observed. This naturally leads 
to a more complete understanding of the hardening 
mechanism. 

Both the X-ray and electron-microscope techniques 
show that in virtually all age-hardening systems the 
initial precipitate is not the same structure as the equi- 
librium phase. Instead, an ageing sequence: zones -> 
intermediate precipitates -> equilibrium precipitate is 
followed. This sequence occurs because the equilib- 
rium precipitate is incoherent with the matrix, whereas 
the transition structures are either fully coherent, as in 
the case of zones, or at least partially coherent. Then, 
because of the importance of the surface energy and 
strain energy of the precipitate to the precipitation pro- 
cess, the system follows such a sequence in order to 
have the lowest free energy in all stages of precipita- 
tion. The surface energy of the precipitates dominates 
the process of nucleation when the interfacial energy is 
large (i.e. when there is a discontinuity in atomic struc- 
ture, somewhat like a grain boundary, at the interface 
between the nucleus and the matrix), so that for the 
incoherent type of precipitate the nuclei must exceed a 
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Figure 8.3 (a) Small-angle X-ray pattern from aluminium-4% copper single crystal taken with molybdenum Ka radiation at a 
sample to film distance of 4 cm ( after Guinier and Fournet, 1955; courtesy of John Wiley and Sons), (b) Electron micrograph 
of aluminium-4% copper aged 16 hours at 130° C, showing GP [1] zones (after Nicholson, Thomas and Nutting, 1958-9). 


certain minimum size before they can nucleate a new 
phase. To avoid such a slow mode of precipitation 
a coherent type of precipitate is formed instead, for 
which the size effect is relatively unimportant. The 
condition for coherence usually requires the precipi- 
tate to strain its equilibrium lattice to fit that of the 
matrix, or to adopt a metastable lattice. However, in 
spite of both a higher volume free energy and a higher 
strain energy, the transition structure is more stable in 
the early stages of precipitation because of its lower 
interfacial energy. 

When the precipitate does become incoherent the 
alloy will, nevertheless, tend to reduce its surface 
energy as much as possible, by arranging the orienta- 
tion relationship between the matrix and the precipitate 
so that the crystal planes which are parallel to, and sep- 
arated by, the bounding surface have similar atomic 
spacings. Clearly, for these habit planes, as they are 
called, the better the crystallographic match, the less 
will be the distortion at the interface and the lower 
the surface energy. This principle governs the precip- 
itation of many alloy phases, as shown by the fre- 
quent occurrence of the Widmanstatten structure, i.e. 
plate-shaped precipitates lying along prominent crys- 
tallographic planes of the matrix. Most precipitates are 
plate- shaped because the strain energy factor is least 
for this form. 

The existence of a precipitation sequence is reflected 
in the ageing curves and, as we have seen in 
Figure 8.2, often leads to two stages of hardening. 
The zones, by definition, are coherent with the 
matrix, and as they form the alloy becomes harder. 
The intermediate precipitate may be coherent with 
the matrix, in which case a further increase of 
hardness occurs, or only partially coherent, when either 
hardening or softening may result. The equilibrium 
precipitate is incoherent and its formation always leads 
to softening. These features are best illustrated by a 
consideration of some actual age-hardening systems. 


Precipitation reactions occur in a wide variety 
of alloy systems as shown in Table 8.1. The 
aluminium- copper alloy system exhibits the greatest 
number of intermediate stages in its precipitation 
process, and consequently is probably the most 
widely studied. When the copper content is high and 
the ageing temperature low, the sequence of stages 
followed is GP [1], GP [2], O' and 0 (CuA 1 2 ). On 
ageing at higher temperatures, however, one or more 
of these intermediate stages may be omitted and, 
as shown in Figure 8.2, corresponding differences 
in the hardness curves can be detected. The early 
stages of ageing are due to GP [1] zones, which 
are interpreted as plate-like clusters of copper atoms 
segregated onto {100} planes of the aluminium matrix. 
A typical small-angle X-ray scattering pattern and 
thin-foil transmission electron micrograph from GP [1] 
zones are shown in Figure 8.3. The plates are only 
a few atomic planes thick (giving rise to the (100) 
streaks in the X-ray pattern), but are about 10 nm long, 
and hence appear as bright or dark lines on the electron 
micrograph. 

GP [2] is best described as a coherent intermediate 
precipitate rather than a zone, since it has a defi- 
nite crystal structure; for this reason the symbol 6" 
is often preferred. These precipitates, usually of max- 
imum thickness 10 nm and up to 150 nm diameter, 
have a tetragonal structure which fits perfectly with 
the aluminium unit cell in the a and b directions but 
not in the c. The structure postulated has a central 
plane which consists of 100% copper atoms, the next 
two planes a mixture of copper and aluminium and 
the other two basal planes of pure aluminium, giv- 
ing an overall composition of CuA 1 2 . Because of their 
size, 0" precipitates are easily observed in the elec- 
tron microscope, and because of the ordered arrange- 
ments of copper and aluminium atoms within the 
structure, their presence gives rise to intensity max- 
ima on the diffraction streaks in an X-ray photograph. 
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Table 8.1 Some common precipitation-hardening systems 


Base 

metal 

Solute 

Transition structure 

Equilibrium 

precipitate 

A1 

Cu 

(i) Plate-like solute rich GP [1] zones on 
(10 0 }ai ; (h) ordered zones of GP [2]; 

(iii) O '- -phase (plates). 

0- CuAl 2 


Ag 

(i) Spherical solute-rich zones; (ii) platelets 
of hexagonal y' on {1 1 1 }ai- 

K-Ag 2 Al 


Mg, Si 

(i) GP zones rich in Mg and Si atoms on 
{10 0 }ai planes; (ii) ordered zones of ft. 

/TMg 2 Si 

(plates) 


Mg, Cu 

(i) GP zones rich in Mg and Cu atoms on 
(10 0 }ai planes; (ii) S' platelets on 
{02 1 }ai planes. 

S-Al 2 CuMg 

(laths) 


Mg, Zn 

(i) Spherical zones rich in Mg and Zn; (ii) platelets 
of rj' phase on {1 1 1 }ai* 

? 7 -MgZn 2 

(plates) 

Cu 

Be 

(i) B e-rich regions on {100}c u planes; (ii) y' . 

y-CuBe 


Co 

Spherical GP zones. 

ft - Co plates 

Fe 

C 

(i) Martensite ( a '); (ii) martensite (a"); 

(iii) e-carbide. 

Fe 3 C plates 
cementite 


N 

(i) Nitrogen martensite (a'); (ii) martensite 
(a") discs. 

Fe 4 N 

Ni 

Al, Ti 

y' cubes 

y-Ni 3 (AlTi) 


Since the c parameter 0.78 nm differs from that of 
aluminium 0.404 nm the aluminium planes parallel to 
the plate are distorted by elastic coherency strains. 
Moreover, the precipitate grows with the c direction 
normal to the plane of the plate, so that the strain 
fields become larger as it grows and at peak hard- 
ness extend from one precipitate particle to the next 
(see Figure 8.4a). The direct observation of coherency 
strains confirms the theories of hardening based on the 
development of an elastically strained matrix (see next 
section). 

The transition structure 9' is tetragonal; the true 
unit cell dimensions are a = 0.404 and c = 0.58 nm 
and the axes are parallel to (10 0) A i directions. The 
strains around the 9' plates can be relieved, however, 
by the formation of a stable dislocation loop around 
the precipitate and such a loop has been observed 
around small 6' plates in the electron microscope as 
shown in Figure 8.4b. The long-range strain fields 
of the precipitate and its dislocation largely cancel. 
Consequently, it is easier for glide dislocations to move 
through the lattice of the alloy containing an incoherent 
precipitate such as 6' than a coherent precipitate such 
as 0", and the hardness falls. 

The 0 structure is also tetragonal, with a = 0.606 
and c = 0.487 nm. This equilibrium precipitate is 
incoherent with the matrix and its formation always 
leads to softening, since coherency strains disap- 
pear. 

8.2.2 Precipitation-hardening of Al-Ag alloys 

Investigations using X-ray diffraction and electron 
microscopy have shown the existence of three dis- 
tinct stages in the age-hardening process, which may 
be summarized: silver-rich clusters -> intermediate 


hexagonal y' -> equilibrium hexagonal y. The hard- 
ening is associated with the first two stages in which 
the precipitate is coherent and partially coherent with 
the matrix, respectively. 

During the quench and in the early stages of ageing, 
silver atoms cluster into small spherical aggregates and 
a typical small-angle X-ray picture of this stage, shown 
in Figure 8.5a, has a diffuse ring surrounding the trace 
of the direct beam. The absence of intensity in the 
centre of the ring (i.e. at (0 00)) is attributed to the 
fact that clustering takes place so rapidly that there is 
left a shell-like region surrounding each cluster which 
is low in silver content. On ageing, the clusters grow in 
size and decrease in number, and this is characterized 
by the X-ray pattern showing a gradual decrease in ring 
diameter. The concentration and size of clusters can be 
followed very accurately by measuring the intensity 
distribution across the ring as a function of ageing 
time. This intensity may be represented (see Chapter 5) 
by an equation of the form 

l(s) = Mn 2 [exp (—2n 2 R 2 e 2 /3X 2 ) 

- exp (-2n 2 R]s 2 /3X 2 )] 2 (8.1) 

and for values of e greater than that corresponding 
to the maximum intensity, the contribution of the 
second term, which represents the denuded region 
surrounding the cluster, can be neglected. Figure 8.5b 
shows the variation in the X-ray intensity, scattered at 
small angles (SAS) with cluster growth, on ageing an 
aluminium -silver alloy at 120°C. An analysis of this 
intensity distribution, using equation (8.1), indicates 
that the size of the zones increases from 2 to 5 nm in 
just a few hours at 120°C. These zones may, of course, 
be seen in the electron microscope and Figure 8.6a 
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(c) 


Figure 8.4 Electron micrographs from Al-4Cu (a) aged 
5 hours at 160° C showing 9" plates, (b) aged 12 hours at 
200° C showing a dislocation ring round 0" plates, (c) aged 
3 days at 160° C showing 9" precipitated on helical 
dislocations (after Nicholson, Thomas and Nutting, 1958-9). 

is an electron micrograph showing spherical zones 
in an aluminium -silver alloy aged 5 hours at 160°C; 
the diameter of the zones is about 10 nm in good 
agreement with that deduced by X-ray analysis. The 
zone shape is dependent upon the relative diameters 
of solute and solvent atoms. Thus, solute atoms such 
as silver and zinc which have atomic sizes similar to 
aluminium give rise to spherical zones, whereas solute 
atoms such as copper which have a high misfit in the 
solvent lattice form plate-like zones. 

With prolonged annealing, the formation and growth 
of platelets of a new phase, /, occur. This is charac- 
terized by the appearance in the X-ray pattern of short 
streaks passing through the trace of the direct beam 
(Figure 8.5c). The y' platelet lies parallel to the {1 1 1} 
planes of the matrix and its structure has lattice param- 
eters very close to that of aluminium. However, the 



Figure 8.5 Small-angle scattering of Cu Ka radiation by 
polycrystalline Al-Ag. (a) After quenching from 520° C 
(after Guinier and Walker, 1953). (b) The change in ring 
intensity and ring radius on ageing at 120° C (after 
Smallman and Westmacott, unpublished), (c) After ageing at 
140° C for 10 days (after Guinier and Walker, 1953). 


structure is hexagonal and, consequently, the precipi- 
tates are easily recognizable in the electron microscope 
by the stacking fault contrast within them, as shown in 
Figure 8.6b. Clearly, these precipitates are never fully 
coherent with the matrix, but, nevertheless, in this alloy 
system, where the zones are spherical and have little or 
no coherency strain associated with them, and where 
no coherent intermediate precipitate is formed, the par- 
tially coherent y' precipitates do provide a greater 
resistance to dislocation movement than zones and a 
second stage of hardening results. 

The same principles apply to the constitution- 
ally more complex ternary and quaternary alloys 
as to the binary alloys. Spherical zones are found 
in aluminium-magnesium-zinc alloys as in alu- 
minium-zinc, although the magnesium atom is some 
12% larger than the aluminium atom. The intermedi- 
ate precipitate forms on the {11 l} A i planes, and is 
partially coherent with the matrix with little or no 
strain field associated with it. Hence, the strength of 
the alloy is due purely to dispersion hardening, and 
the alloy softens as the precipitate becomes coarser. 
In nickel-based alloys the hardening phase is the 
ordered ]/-Ni 3 Al; this y' is an equilibrium phase in 
the Ni-Al and Ni-Cr-Al systems and a metastable 
phase in Ni-Ti and Ni-Cr-Ti. These systems form 
the basis of the 4 superalloys ’ (see Chapter 9) which 
owe their properties to the close matching of the y' 
and the fee matrix. The two phases have very simi- 
lar lattice parameters ((<0.25%), depending on com- 
position) and the coherency (interfacial energy y\ ~ 
10-20 mJ/m 2 ) confers a very low coarsening rate on 
the precipitate so that the alloy overages extremely 
slowly even at 0.7T m . 
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Figure 8.6 Electron micrographs from Al-Ag alloy (a) aged 
5 hours at 160° C showing spherical zones, and (b) aged 
5 days at 160° C showing y' precipitate ( after Nicholson, 
Thomas and Nutting, 1958-9). 

8.2.3 Mechanisms of precipitation-hardening 

8.2.3. 1 The significance of particle 
deformability 

The strength of an age-hardening alloy is governed by 
the interaction of moving dislocations and precipitates. 
The obstacles in precipitation-hardening alloys which 
hinder the motion of dislocations may be either (1) the 
strains around GP zones, (2) the zones or precipitates 
themselves, or both. Clearly, if it is the zones them- 
selves which are important, it will be necessary for 
the moving dislocations either to cut through them or 
go round them. Thus, merely from elementary reason- 
ing, it would appear that there are at least three causes 
of hardening, namely: (1) coherency strain hardening, 
(2) chemical hardening, i.e. when the dislocation cuts 
through the precipitate, or (3) dispersion hardening, i.e. 
when the dislocation goes round or over the precipitate. 

The relative contributions will depend on the 
particular alloy system but, generally, there is a critical 
dispersion at which the strengthening is a maximum, as 
shown in Figure 8.7. In the small-particle regime the 
precipitates, or particles, are coherent and deformable 
as the dislocations cut through them, while in the 
larger-particle regime the particles are incoherent 
and non-deformable as the dislocations bypass them. 
For deformable particles, when the dislocations pass 
through the particle, the intrinsic properties of the 
particle are of importance and alloy strength varies 
only weakly with particle size. For non-deformable 


particles, when the dislocations bypass the particles, 
the alloy strength is independent of the particle 
properties but is strongly dependent on particle size 
and dispersion strength decreasing as particle size or 
dispersion increases. The transition from deformable 
to non-deformable particle-controlled deformation is 
readily recognized by the change in microstructure, 
since the ‘laminar’ undisturbed dislocation flow for the 
former contrasts with the turbulent plastic flow for non- 
deformable particles. The latter leads to the production 
of a high density of dislocation loops, dipoles and other 
debris which results in a high rate of work-hardening. 
This high rate of work-hardening is a distinguishing 
feature of all dispersion-hardened systems. 

8. 2. 3. 2 Coherency strain-hardening 
The precipitation of particles having a slight misfit in 
the matrix gives rise to stress fields which hinder the 
movement of gliding dislocations. For the dislocations 
to pass through the regions of internal stress the applied 
stress must be at least equal to the average internal 
stress, and for spherical particles this is given by 

r = 2 ptef (8.2) 

where pi is the shear modulus, s is the misfit of the 
particle and / is the volume fraction of precipitate. 
This suggestion alone, however, cannot account for 
the critical size of dispersion of a precipitate at which 
the hardening is a maximum, since equation (8.2) is 
independent of L, the distance between particles. To 
explain this, Mott and Nabarro consider the extent to 
which a dislocation can bow round a particle under 
the action of a stress r. Like the bowing stress of a 
Frank-Read source this is given by 

r = aptb/z (8.3) 

where r is the radius of curvature to which the dislo- 
cation is bent which is related to the particle spacing. 
Hence, in the hardest age-hardened alloys where the 



Figure 8.7 Variation of strength with particle size, defining 
the deformable and non-deformable particle regimes. 
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yield strength is about /x/ 100, the dislocation can bend 
to a radius of curvature of about 100 atomic spac- 
ings, and since the distance between particles is of the 
same order it would appear that the dislocation can 
avoid the obstacles and take a form like that shown in 
Figure 8.8a. With a dislocation line taking up such a 
configuration, in order to produce glide, each section 
of the dislocation line has to be taken over the adverse 
region of internal stress without any help from other 
sections of the line — the alloy is then hard. If the 
precipitate is dispersed on too fine a scale (e.g. when 
the alloy has been freshly quenched or lightly aged) 
the dislocation is unable or bend sufficiently to lie 
entirely in the regions of low internal stress. As a 
result, the internal stresses acting on the dislocation 
line largely cancel and the force resisting its move- 
ment is small — the alloy then appears soft. When 
the dispersion is on a coarse scale, the dislocation line 
is able to move between the particles, as shown in 
Figure 8.8b, and the hardening is again small. 

For coherency strain hardening the flow stress 
depends on the ability of the dislocation to bend and 
thus experience more regions of adverse stress than of 
aiding stress. The flow stress therefore depends on the 
treatment of averaging the stress, and recent attempts 
separate the behaviour of small and large coherent par- 
ticles. For small coherent particles the flow stress is 
given by 

T = AA l is il2 p l2 {r/by 12 (8.4) 

which predicts a greater strengthening than the sim- 
ple arithmetic average of equation (8.2). For large 
coherent particles 

r = 0.7 nf l/2 (eb 3 /r 3 ) 11 * (8.5) 

8.2.3. 3 Chemical hardening 

When a dislocation actually passes through a zone 
as shown in Figure 8.9 a change in the number of 
solvent -solute near-neighbours occurs across the slip 
plane. This tends to reverse the process of cluster- 
ing and, hence, additional work must be done by the 
applied stress to bring this about. This process, known 
as chemical hardening, provides a short-range interac- 
tion between dislocations and precipitates and arises 
from three possible causes: (1) the energy required 
to create an additional particle/matrix interface with 
energy y\ P er unit area which is provided by a stress 

r ~ a Y l'\fr) l ' 2 /^b 2 (8.6) 

where o' is a numerical constant, (2) the additional 
work required to create an antiphase boundary inside 
the particle with ordered structure, given by 

T - PYl£(fr) l/2 /nb 2 (8.7) 

where fi is a numerical constant, and (3) the change 
in width of a dissociated dislocation as it passes 


Stress field of 




Figure 8.8 Schematic representation of a dislocation (a) 
curling round the stress fields from precipitates and (b) 
passing between widely spaced precipitates ( Orowan 
looping ). 

through the particle where the stacking fault energy 
differs from the matrix (e.g. Al-Ag where Ay SF ~ 
100 mJ/m 2 between Ag zones and A1 matrix) so that 

r ~ Aysv/b (8.8) 

Usually y\ < y ap b and so y\ can be neglected, but the 
ordering within the particle requires the dislocations to 
glide in pairs. This leads to a strengthening given by 

t = (n t J2b)[A Y ^ b rf /tzT ) 1 ' 2 - /] (8.9) 

where T is the dislocation line tension. 

8. 2. 3. 4 Dispersion-hardening 
In dispersion-hardening it is assumed that the precipi- 
tates do not deform with the matrix and that the yield 
stress is the stress necessary to expand a loop of dislo- 
cation between the precipitates. This will be given by 
the Orowan stress 

r = aptb/L (8.10) 

where L is the separation of the precipitates. As dis- 
cussed above, this process will be important in the later 
stages of precipitation when the precipitate becomes 
incoherent and the misfit strains disappear. A mov- 
ing dislocation is then able to bypass the obstacles, as 
shown in Figure 8.8b, by moving in the clean pieces 
of crystal between the precipitated particles. The yield 
stress decreases as the distance between the obsta- 
cles increases in the over-aged condition. However, 
even when the dispersion of the precipitate is coarse 
a greater applied stress is necessary to force a dislo- 
cation past the obstacles than would be the case if the 
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Figure 8.9 Ordered particle (a) cut by dislocations in (b) to produce new interface and apb. 


obstruction were not there. Some particle or precipitate 
strengthening remains but the majority of the strength- 
ening arises from the dislocation debris left around the 
particles giving rise to high work-hardening. 

8. 2. 3. 5 Hardening mechanisms in Al-Cu alloys 
The actual hardening mechanism which operates in a 
given alloy will depend on several factors, such as 
the type of particle precipitated (e.g. whether zone, 
intermediate precipitate or stable phase), the mag- 
nitude of the strain and the testing temperature. In 
the earlier stages of ageing (i.e. before over-ageing) 
the coherent zones are cut by dislocations moving 
through the matrix and hence both coherency strain 
hardening and chemical hardening will be important, 
e.g. in such alloys as aluminium- copper, copper- 
beryllium and iron-vanadium-carbon. In alloys such 
as aluminium -silver and aluminium -zinc, however, 
the zones possess no strain field, so that chemical 
hardening will be the most important contribution. In 
the important high-temperature creep-resistant nickel 
alloys the precipitate is of the Ni 3 Al form which has 
a low particle/matrix misfit and hence chemical hard- 
ening due to dislocations cutting the particles is again 
predominant. To illustrate that more than one mech- 
anism of hardening is in operation in a given alloy 
system, let us examine the mechanical behaviour of 
an aluminium- copper alloy in more detail. 

Figure 8.10 shows the deformation characteristics 
of single crystals of an aluminium- copper (nominally 


4%) alloy in various structural states. The curves were 
obtained by testing crystals of approximately the same 
orientation, but the stress -strain curves from crystals 
containing GP [1] and GP [2] zones are quite different 
from those for crystals containing 6' or 0 precipitates. 
When the crystals contain either GP [1] or GP [2] 
zones, the stress -strain curves are very similar to those 
of pure aluminium crystals, except that there is a two- 
or threefold increase in the yield stress. In contrast, 
when the crystals contain either 0' or 0 precipitates the 
yield stress is less than for crystals containing zones, 
but the initial rate of work-hardening is extremely 
rapid. In fact, the stress -strain curves bear no simi- 
larity to those of a pure aluminium crystal. It is also 
observed that when 6' or 6 is present as a precipitate, 
deformation does not take place on a single slip sys- 
tem but on several systems; the crystal then deforms, 
more nearly as a polycrystal does and the X-ray pattern 
develops extensive asterism. These factors are consis- 
tent with the high rate of work-hardening observed in 
crystals containing O' or 0 precipitates. 

The separation of the precipitates cutting any slip 
plane can be deduced from both X-ray and electron- 
microscope observations. For the crystals, relating to 
Figure 8.10, containing GP [1] zones this value is 
15 nm and for GP [2] zones it is 25 nm. It then follows 
from equation (8.3) that to avoid these precipitates the 
dislocations would have to bow to a radius of cur- 
vature of about 10 nm. To do this requires a stress 
several times greater than the observed flow stress and, 



Figure 8.10 Stress- strain curves from single crystals of aluminium- 4% copper containing GP [1] zones, GP [2], zones, 
9' -precipitates and 0 -precipitates respectively ( after Fine, Bryne and Kelly). 
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in consequence, it must be assumed that the disloca- 
tions are forced through the zones. Furthermore, if we 
substitute the observed values of the flow stress in the 
relation fib /z = L, it will be evident that the bowing 
mechanism is unlikely to operate unless the particles 
are about 60 nm apart. This is confirmed by electron- 
microscope observations which show that dislocations 
pass through GP zones and coherent precipitates, but 
bypass non-coherent particles. Once a dislocation has 
cut through a zone, however, the path for subsequent 
dislocations on the same slip plane will be easier, 
so that the work-hardening rate of crystals containing 
zones should be low, as shown in Figure 8.10. The 
straight, well-defined slip bands observed on the sur- 
faces of crystals containing GP [1] zones also support 
this interpretation. 

If the zones possess no strain field, as in alu- 
minium-silver or aluminium-zinc alloys, the flow 
stress would be entirely governed by the chemical 
hardening effect. However, the zones in aluminium 
copper alloys do possess strain fields, as shown in 
Figure 8.4, and, consequently, the stresses around a 
zone will also affect the flow stress. Each dislocation 
will be subjected to the stresses due to a zone at a 
small distance from the zone. 

It will be remembered from Chapter 7 that temper- 
ature profoundly affects the flow stress if the barrier 
which the dislocations have to overcome is of a short- 
range nature. For this reason, the flow stress of crystals 
containing GP [1] zones will have a larger dependence 
on temperature than that of those containing GP [2] 
zones. Thus, while it is generally supposed that the 
strengthening effect of GP [2] zones is greater than 
that of GP [1], and this is true at normal tempera- 
tures (see Figure 8.10), at very low temperatures it 
is probable that GP [1] zones will have the greater 
strengthening effect due to the short-range interactions 
between zones and dislocations. 

The O' and 0 precipitates are incoherent and do not 
deform with the matrix, so that the critical resolved 
shear stress is the stress necessary to expand a loop 
of dislocation between them. This corresponds to the 


over- aged condition and the hardening to dispersion- 
hardening. The separation of the 0 particles is greater 
than that of the O', being somewhat greater than 1 pm 
and the initial flow stress is very low. In both cases, 
however, the subsequent rate of hardening is high 
because, as suggested by Fisher, Hart and Pry, the 
gliding dislocation interacts with the dislocation loops 
in the vicinity of the particles (see Figure 8.8b). The 
stress -strain curves show, however, that the rate of 
work-hardening falls to a low value after a few per 
cent strain, and these authors attribute the maximum 
in the strain-hardening curve to the shearing of the 
particles. This process is not observed in crystals con- 
taining 0 precipitates at room temperature and, con- 
sequently, it seems more likely that the particles will 
be avoided by cross-slip. If this is so, prismatic loops 
of dislocation will be formed at the particles, by the 
mechanism shown in Figure 8.11, and these will give 
approximately the same mean internal stress as that 
calculated by Fisher, Hart and Pry, but a reduced stress 
on the particle. The maximum in the work-hardening 
curve would then correspond to the stress necessary to 
expand these loops; this stress will be of the order of 
pb/r where r is the radius of the loop which is some- 
what greater than the particle size. At low temperatures 
cross-slip is difficult and the stress may be relieved 
either by initiating secondary slip or by fracture. 

8.2.4 Vacancies and precipitation 

It is clear that because precipitation is controlled by the 
rate of atomic migration in the alloy, temperature will 
have a pronounced effect on the process. Moreover, 
since precipitation is a thermally activated process, 
other variables such as time of annealing, composition, 
grain size and prior cold work are also important. 
However, the basic treatment of age-hardening alloys 
is solution treatment followed by quenching, and the 
introduction of vacancies by the latter process must 
play an important role in the kinetic behaviour. 

It has been recognized that near room temperature, 
zone formation in alloys such as aluminium- copper 
and aluminium- silver occurs at a rate many orders 
of magnitude greater than that calculated from the 


(a) 





Figure 8.11 Cross-slip of (a) edge and (b) screw dislocation over a particle producing prismatic loops in the process. 
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diffusion coefficient of the solute atoms. In alu- 
minium-copper, for example, the formation of zones 
is already apparent after only a few minutes at room 
temperature, and is complete after an hour or two, 
so that the copper atoms must therefore have moved 
through several atomic spacings in that time. This cor- 
responds to an apparent diffusion coefficient of copper 
in aluminium of about 10 — 20 — 10 — 22 m 2 s _1 , which is 
many orders of magnitude faster than the value of 
5 x 10 -29 m 2 s -1 obtained by extrapolation of high- 
temperature data. Many workers have attributed this 
enhanced diffusion to the excess vacancies retained 
during the quenching treatment. Thus, since the expres- 
sion for the diffusion coefficient at a given temperature 
contains a factor proportional to the concentration of 
vacancies at that temperature, if the sample contains an 
abnormally large vacancy concentration then the diffu- 
sion coefficient should be increased by the ratio cq/c 0 , 
where cq is the quenched-in vacancy concentration and 
c 0 is the equilibrium concentration. The observed clus- 
tering rate can be accounted for if the concentration of 
vacancies retained is about 10 _3 -10 -4 . 

The observation of loops by transmission electron 
microscopy allows an estimate of the number of 
excess vacancies to be made, and in all cases of 
rapid quenching the vacancy concentration in these 
alloys is somewhat greater than 10 -4 , in agreement 
with the predictions outlined above. Clearly, as 
the excess vacancies are removed, the amount of 
enhanced diffusion diminishes, which agrees with the 
observations that the isothermal rate of clustering 
decreases continuously with increasing time. In fact, 
it is observed that D decreases rapidly at first and 
then remains at a value well above the equilibrium 
value for months at room temperature; the process is 
therefore separated into what is called the fast and 
slow reactions. A mechanism proposed to explain the 
slow reaction is that some of the vacancies quenched- 
in are trapped temporarily and then released slowly. 
Measurements show that the activation energy in the 
fast reaction (~0.5 eV) is smaller than in the slow 
reaction (~1 eV) by an amount which can be attributed 
to the binding energy between vacancies and trapping 
sites. These traps are very likely small dislocation 
loops or voids formed by the clustering of vacancies. 
The equilibrium matrix vacancy concentration would 
then be greater than that for a well-annealed crystal by 
a factor exp [y£2/rkT], where y is the surface energy, 
Q the atomic volume and r the radius of the defect 
(see Chapter 4). The experimental diffusion rate can 
be accounted for if r ~ 2 nm, which is much smaller 
than the loops and voids usually seen, but they do exist. 
The activation energy for the slow reaction would then 
be E d — (y£l/r) or approximately 1 eV for r ~ 2 nm. 

Other factors known to affect the kinetics of the 
early stages of ageing (e.g. altering the quenching rate, 
interrupted quenching and cold work) may also be 
rationalized on the basis that these processes lead to 
different concentrations of excess vacancies. In gen- 
eral, cold working the alloy prior to ageing causes 


a decrease in the rate of formation of zones, which 
must mean that the dislocations introduced by cold 
work are more effective as vacancy sinks than as 
vacancy sources. Cold working or rapid quenching 
therefore have opposing effects on the formation of 
zones. Vacancies are also important in other aspects 
of precipitation-hardening. For example, the excess 
vacancies, by condensing to form a high density of 
dislocation loops, can provide nucleation sites for 
intermediate precipitates. This leads to the interest- 
ing observation in aluminium- copper alloys that cold 
working or rapid quenching, by producing dislocations 
for nucleation sites, have the same effect on the for- 
mation of the 8' phase but, as we have seen above, the 
opposite effect on zone formation. It is also interesting 
to note that screw dislocations, which are not normally 
favourable sites for nucleation, can also become sites 
for preferential precipitation when they have climbed 
into helical dislocations by absorbing vacancies, and 
have thus become mainly of edge character. The long 
arrays of 0' phase observed in aluminium- copper 
alloys, shown in Figure 8.4c, have probably formed 
on helices in this way. In some of these alloys, defects 
containing stacking faults are observed, in addition to 
the dislocation loops and helices, and examples have 
been found where such defects nucleate an interme- 
diate precipitate having a hexagonal structure. In alu- 
minium-silver alloys it is also found that the helical 
dislocations introduced by quenching absorb silver and 
degenerate into long narrow stacking faults on {1 1 1} 
planes; these stacking-fault defects then act as nuclei 
for the hexagonal y' precipitate. 

Many commercial alloys depend critically on 
the interrelation between vacancies, dislocations and 
solute atoms and it is found that trace impurities 
significantly modify the precipitation process. Thus 
trace elements which interact strongly with vacancies 
inhibit zone formation, e.g. Cd, In, Sn prevent zone 
formation in slowly quenched Al-Cu alloys for up 
to 200 days at 30° C. This delays the age-hardening 
process at room temperature which gives more time for 
mechanically fabricating the quenched alloy before it 
gets too hard, thus avoiding the need for refrigeration. 
On the other hand, Cd increases the density of 6' 
precipitate by increasing the density of vacancy loops 
and helices which act as nuclei for precipitation and by 
segregating to the matrix-0' interfaces thereby reducing 
the interfacial energy. 

Since grain boundaries absorb vacancies in many 
alloys there is a grain boundary zone relatively free 
from precipitation. The Al-Zn-Mg alloy is one com- 
mercial alloy which suffers grain boundary weakness 
but it is found that trace additions of Ag have a ben- 
eficial effect in refining the precipitate structure and 
removing the precipitate free grain boundary zone. 
Here it appears that Ag atoms stabilize vacancy clus- 
ters near the grain boundary and also increase the 
stability of the GP zone thereby raising the GP zone 
solvus temperature. Similarly, in the ‘Concorde’ alloy, 
RR58 (basically Al-2.5Cu-l.2Mg with additions), Si 
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addition (0.25 Si) modifies the as-quenched dislocation 
distribution inhibiting the nucleation and growth of 
dislocation loops and reducing the diameter of helices. 
The S-precipitate (Al 2 CuMg) is homogeneously nucle- 
ated in the presence of Si rather than heterogeneously 
nucleated at dislocations, and the precipitate grows 
directly from zones, giving rise to improved and more 
uniform properties. 

Apart from speeding up the kinetics of ageing, 
and providing dislocations nucleation sites, vacan- 
cies may play a structural role when they precipi- 
tate cooperatively with solute atoms to facilitate the 
basic atomic arrangements required for transforming 
the parent crystal structure to that of the product 
phase. In essence, the process involves the system- 
atic incorporation of excess vacancies, produced by the 
initial quench or during subsequent dislocation loop 
annealing, in a precipitate zone or plate to change the 
atomic stacking. A simple example of O' formation in 
Al-Cu is shown schematically in Figure 8.12. Ideally, 
the structure of the 0" phase in Al-Cu consists of 
layers of copper on {100} separated by three lay- 
ers of aluminium atoms. If a next-nearest neighbour 
layer of aluminium atoms from the copper layer is 
removed by condensing a vacancy loop, an embryonic 
O' unit cell with A1 in the correct AAA . . . stacking 
sequence is formed (Figure 8.12b). Formation of the 
final CuA1 2 O' fluorite structure requires only shuffling 
half of the copper atoms into the newly created next- 
nearest neighbour space and concurrent relaxation of 
the A1 atoms to the correct O' interplanar distances 
(Figure 8.12c). 

The structural incorporation of vacancies in a pre- 
cipitate is a non-conservative process since atomic 
sites are eliminated. There exist equivalent conserva- 
tive processes in which the new precipitate structure is 
created from the old by the nucleation and expansion 
of partial dislocation loops with predominantly shear 


character. Thus, for example, the BABAB {100} plane 
stacking sequence of the fee structure can be changed 
to BAABA by the propagation of an a/ 2(1 00) shear 
loop in the {10 0} plane, or to BAAAB by the propa- 
gation of a pair of a/2(l 0 0) partials of opposite sign 
on adjacent planes. Again, the AAA stacking resulting 
from the double shear is precisely that required for the 
embryonic formation of the fluorite structure from the 
fee lattice. 

In visualizing the role of lattice defects in the nucle- 
ation and growth of plate-shaped precipitates, a simple 
analogy with Frank and Shockley partial dislocation 
loops is useful. In the formation of a Frank loop, a layer 
of hep material is created from the fee lattice by the 
(non-conservative) condensation of a layer of vacan- 
cies in {1 1 1}. Exactly the same structure is formed by 
the (conservative) expansion of a Shockley partial loop 
on a {1 1 1} plane. In the former case a semi-coherent 
‘precipitate’ is produced bounded by an a/ 3(1 1 1) dis- 
location, and in the latter a coherent one bounded by 
an a 1 6(1 12). Continued growth of precipitate plates 
occurs by either process or a combination of processes. 
Of course, formation of the final precipitate structure 
requires, in addition to these structural rearrangements, 
the long-range diffusion of the correct solute atom con- 
centration to the growing interface. 

The growth of a second-phase particle with a dis- 
parate size or crystal structure relative to the matrix 
is controlled by two overriding principles -the accom- 
modation of the volume and shape change, and the 
optimized use of the available deformation mecha- 
nisms. In general, volumetric transformation strains 
are accommodated by vacancy or interstitial conden- 
sation, or prismatic dislocation loop punching, while 
deviatoric strains are relieved by shear loop prop- 
agation. An example is shown in Figure 8.13. The 
formation of semi-coherent Cu needles in Fe-l%Cu 
is accomplished by the generation of shear loops in 
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Figure 8.12 Schematic diagram showing the transition of 0" to O' in Al-Cu by the vacancy mechanism. Vacancies from 
annealing loops are condensed on a next-nearest Al plane from the copper layer in 0" to form the required AAA Al stacking. 
Formation of the O' fluorite structure then requires only slight redistribution of the copper atom layer and relaxation of the Al 
layer spacings (courtesy of K. H. Westmacott). 
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Figure 8.13 The formation of semicoherent Cu needles in 
Fe-1% Cu ( courtesy of K. H. Westacott). 

the precipitate/matrix interface. Expansion of the loops 
into the matrix and incorporation into nearby precipi- 
tate interfaces leads to a complete network of disloca- 
tions interconnecting the precipitates. 

8.2.5 Duplex ageing 

In non-ferrous heat-treatment there is considerable 
interest in double (or duplex) ageing treatments to 
obtain the best micro structure consistent with opti- 
mum properties. It is now realized that it is unlikely 
that the optimum properties will be produced in alloys 
of the precipitation-hardening type by a single quench 
and ageing treatment. For example, while the interior 
of grains may develop an acceptable precipitate size 
and density, in the neighbourhood of efficient vacancy 
sinks, such as grain boundaries, a precipitate-free zone 
(PFZ) is formed which is often associated with over- 
ageing in the boundary itself. This heterogeneous 
structure gives rise to poor properties, particularly 
under stress corrosion conditions. 

Duplex ageing treatments have been used to over- 
come this difficulty. In Al-Zn-Mg, for example, it 
was found that storage at room temperature before 
heating to the ageing temperature leads to the forma- 
tion of finer precipitate structure and better properties. 
This is just one special example of two-step or multiple 
ageing treatments which have commercial advantages 
and have been found to be applicable to several alloys. 
Duplex ageing gives better competitive mechanical 
properties in Al-alloys (e.g. Al-Zn-Mg alloys) with 
much enhanced corrosion resistance since the grain 
boundary zone is removed. It is possible to obtain 
strengths of 267-308 MN/m 2 in Mg-Zn-Mn alloys 


which have very good strength/weight ratio applica- 
tions, and nickel alloys also develop better properties 
with multiple ageing treatments. 

The basic idea of all heat-treatments is to ‘seed’ 
a uniform distribution of stable nuclei at the low 
temperature which can then be grown to optimum 
size at the higher temperature. In most alloys, there is 
a critical temperature T c above which homogeneous 
nucleation of precipitate does not take place, and 
in some instances has been identified with the GP 
zone solvus. On ageing above T c there is a certain 
critical zone size above which the zones are able to 
act as nuclei for precipitates and below which the 
zones dissolve. 

In general, the ageing behaviour of Al-Zn-Mg 
alloys can be divided into three classes which can be 
defined by the temperature ranges involved: 

1. Alloys quenched and aged above the GP zone 
solvus (i.e. the temperature above which the zones 
dissolve, which is above ~155°C in a typical 
Al-Zn-Mg alloy). Then, since no GP zones are 
ever formed during heat treatment, there are no 
easy nuclei for subsequent precipitation and a very 
coarse dispersion of precipitates results with nucle- 
ation principally on dislocations. 

2. Alloys quenched and aged below the GP zone 
solvus. GP zones form continuously and grow to 
a size at which they are able to transform to pre- 
cipitates. The transformation will occur rather more 
slowly in the grain boundary regions due to the 
lower vacancy concentration there but since age- 
ing will always be below the GP zone solvus, no 
PFZ is formed other than a very small (~30 nm) 
solute-denuded zone due to precipitation in the 
grain boundary. 

3. Alloys quenched below the GP zone solvus and 
aged above it (e.g. quenched to room temperature 
and aged at 180°C for Al-Zn-Mg). This is the most 
common practical situation. The final dispersion of 
precipitates and the PFZ width are controlled by the 
nucleation treatment below 155°C where GP zone 
size distribution is determined. A long nucleation 
treatment gives a fine dispersion of precipitates and 
a narrow PFZ. 

It is possible to stabilize GP zones by addition of 
trace elements. These have the same effect as raising 
T c , so that alloys are effectively aged below T c . One 
example is Ag to Al-Zn-Mg which raises T c from 
155°C to 185°C, another is Si to Al-Cu-Mg, another 
Cu to Al-Mg-Si and yet another Cd or Sn to Al-Cu 
alloys. It is then possible to get uniform distribution 
and optimum properties by single ageing, and is an 
example of achieving by chemistry what can similarly 
be done with physics during multiple ageing. Whether 
it is best to alter the chemistry or to change the physics 
for a given alloy usually depends on other factors (e.g. 
economics). 
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8.2.6 Particle-coarsening 

With continued ageing at a given temperature, there is 
a tendency for the small particles to dissolve and the 
resultant solute to precipitate on larger particles caus- 
ing them to grow, thereby lowering the total interfacial 
energy. This process is termed particle-coarsening, or 
sometimes Ostwald ripening. The driving force for par- 
ticle growth is the difference between the concentration 
of solute ( S r ) in equilibrium with small particles of 
radius r and that in equilibrium with larger particles. 
The variation of solubility with surface curvature is 
given by the Gibbs -Thomson or Thomson -Freundlich 
equation 

ln(S r /S) = 2yQ/kTr (8.11) 

where S is the equilibrium concentration, y the par- 
ticle/matrix interfacial energy and Q the atomic vol- 
ume; since 2yQ <£kTr then S r = S[ 1 + 2yQ/kTr]. 

To estimate the coarsening rate of a particle it is 
necessary to consider the rate-controlling process for 
material transfer. Generally, the rate-limiting factor is 
considered to be diffusion through the matrix and the 
rate of change of particle radius is then derived from 
the equation 

4nr 2 (dr/dt) = D4nR 2 (dS/dR ) 

where dS/dR is the concentration gradient across an 
annulus at a distance R from the particle centre. 
Rewriting the equation after integration gives 

dr/dt = -D(S v -S a )/r (8.12) 

where S a is the average solute concentration a large 
distance from the particle and D is the solute diffusion 
coefficient. When the particle solubility is small, the 
total number of atoms contained in particles may be 
assumed constant, independent of particle size distri- 
bution. Further consideration shows that 

(Sa - S T ) = {2yQS/kTm/f) - (1 /r)\ 

and combining with equation (8.1 1) gives the variation 
of particle growth rate with radius according to 

dr/dt = {2DSy£2/kTr}[(l/r) - (1/r)] (8.13) 

This function is plotted in Figure 8.14, from which 
it is evident that particles of radius less than r are 
dissolving at increasing rates with decreasing values 
of r. All particles of radius greater than r are growing 
but the graph shows a maximum for particles twice 
the mean radius. Over a period of time the number 
of particles decreases discontinuously when particles 
dissolve, and ultimately the system would tend to form 
one large particle. However, before this state is reached 
the mean radius r increases and the growth rate of the 
whole system slows down. 

A more detailed theory than that, due to Green- 
wood, outlined above has been derived by Lifshitz and 
Slyozov, and by Wagner taking into consideration the 



Figure 8.14 The variation of growth rate dr/dt with particle 
radius r for diffusion-controlled growth, for two values of r. 
The value of r for the lower curve is 1.5 times that for the 
upper curve. Particles of radius equal to the mean radius of 
all particles in the system at any instant are neither growing 
nor dissolving. Particles of twice this radius are growing at 
the fastest rate. The smallest particles are dissolving at a 
rate approximately proportional r 2 ( after Greenwood, 1968; 
courtesy of the Institute of Metals). 

initial particle size distribution. They show that the 
mean particle radius varies with time according to 

f - ?l = Kt (8.14) 

where r 0 is the mean particle radius at the onset of 
coarsening and K is a constant given by 

K = %DSyQ>/9kT 

This result is almost identical to that obtained by 
integrating equation (8.13) in the elementary theory 
and assuming that the mean radius is increasing at half 
the rate of that of the fastest-growing particle. 

Coarsening rate equations have also been derived 
assuming that the most difficult step in the process 
is for the atom to enter into solution across the pre- 
cipitate/matrix interface; the growth is then termed 
interface-controlled. The appropriate rate equation is 

dr/dt = -C(S r - S a ) 

and leads to a coarsening equation of the form 

r 2 — Fq = (64CSyttt/8kT) (8.15) 

where C is some interface constant. 

Measurements of coarsening rates so far carried 
out support the analysis basis on diffusion control of 
the particle growth. The most detailed results have 
been obtained for nickel-based systems, particularly 
the coarsening of y'(Ni 3 Al-Ti or Si), which show a 
good r 3 versus t relationship over a wide range of tem- 
peratures. Strains due to coherency and the fact that / 
precipitates are cube- shaped do not seriously affect the 
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Figure 8.15 The variation of r 3 with time of annealing for 
manganese precipitates in a magnesium matrix (after Smith, 
1967; courtesy of Pergamon Press). 


analysis in these systems. Concurrent measurements 
of r and the solute concentration in the matrix dur- 
ing coarsening have enabled values for the interfacial 
energy ~13 mJ/m 2 to be determined. In other sys- 
tems the agreement between theory and experiment is 
generally less precise, although generally the cube of 
the mean particle radius varies linearly with time, as 
shown in Figure 8.15 for the growth of Mn precipitates 
in a Mg-Mn alloy. 

Because of the ease of nucleation, particles may 
tend to concentrate on grain boundaries, and hence 
grain boundaries may play an important part in particle 
growth. For such a case, the Thomson -Freundlich 
equation becomes 

ln(S r /S) = (2 y - y g )Cl/kTx 

where y g is the grain boundary energy per unit area 
and 2x the particle thickness, and their growth follows 
a law of the form 

r) - r\ = Kt (8.16) 

where the constant K includes the solute diffusion 
coefficient in the grain boundary and the boundary 
width. The activation energy for diffusion is lower in 
the grain boundary than in the matrix and this leads 
to a less strong dependence on temperature for the 
growth of grain boundary precipitates. For this reason 
their preferential growth is likely to be predominant 
only at relatively low temperature. 

8.2.7 Spinodal decomposition 

For any alloy composition where the free energy curve 
has a negative curvature, i.e. (d 2 G/dc 2 ) < 0, small 
fluctuations in composition that produce A-rich and 
B-rich regions will bring about a lowering of the total 
free energy. At a given temperature the alloy must lie 
between two points of inflection (where d 2 G/dc 2 = 0) 
and the locus of these points at different temperatures 
is depicted on the phase diagram by the chemical 
spinodal line (see Figure 8.16). 



Figure 8.16 Variation of chemical and coherent spinodal 
with composition. 



(a) 


(b) 




(c) (d) 

Figure 8.17 Composition fluctuations in a spinodal system. 


For an alloy c 0 quenched inside this spinodal, com- 
position fluctuations increase very rapidly with time 
and have a time constant r = — k/4n 2 D , where k is the 
wavelength of composition modulations in one dimen- 
sion and D is the interdiffusion coefficient. For such 
a kinetic process, shown in Figure 8.17, ‘uphill’ diffu- 
sion takes place, i.e. regions richer in solute than the 
average become richer, and poorer become poorer until 
the equilibrium compositions c\ and c 2 of the A-rich 
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and B-rich regions are formed. As for normal precipita- 
tion, interfacial energy and strain energy influency the 
decomposition. During the early stages of decomposi- 
tion the interface between A-rich and B-rich regions 
is diffuse and the interfacial energy becomes a gradi- 
ent energy which depends on the composition gradient 
across the interface according to 

AG int = K(Ac/X) 2 (8.17) 

where k is the wavelength and Ac the amplitude of 
the sinusoidal composition modulation, and K depends 
on the difference in bond energies between like and 
unlike atom pairs. The coherency strain energy term 
is related to the misfit s between regions A and B, 
where e = (l/a)da/dc, the fractional change in lattice 
parameter a per unit composition change, and is given 
for an elastically isotropic solid, by 


A G strain = e 2 Ac 2 EV/(1 - v) 


(8.18) 


with E Young’s modulus, v Poisson’s ratio and V the 
molar volume. The total free energy change arising 
from a composition fluctuation is therefore 


AG = 


d 2 G 2 K 9 

_ + _ + (2e **V/( 1-v) 


Ac 2 / 2 


(8.19) 

and a homogeneous solid solution will decompose 
spinodally provided 


— (d 2 G/dc 2 ) > (2 K/X 2 ) + (2s 2 EV/1 - v) (8.20) 


For X = oo, the condition [(d 2 G/dc 2 ) + (2s 2 EV/\ — 
v)] = 0 is known as the coherent spinodal, as shown 
in Figure 8.16. The k of the composition modulations 
has to satisfy the condition 

X 2 > 2K/[d 2 G/dc 2 + (2e 2 EV/l - v)] (8.21) 


and decreases with increasing degree of undercool- 
ing below the coherent spinodal line. A A-value of 
5-10 nm is favoured, since shorter k 9 s have too sharp 
a concentration gradient and longer k 9 s have too large 
a diffusion distance. For large misfit values, a large 
undercooling is required to overcome the strain energy 
effect. In cubic crystals, E is usually smaller along 
(100) directions and the high strain energy is accom- 
modated more easily in the elastically soft directions, 
with composition modulations localized along this 
direction. 

Spinodal decompositions have now been studied in 
a number of systems such as Cu-Ni-Fe, Cu-Ni-Si, 
Ni-12Ti, Cu-5Ti exhibiting ‘side-bands’ in X-ray 
small-angle scattering, satellite spots in electron 
diffraction patterns and characteristic modulation of 
structure along (100) in electron micrographs. Many 
of the alloys produced by splat cooling might be 
expected to exhibit spinodal decomposition, and it has 


been suggested that in some alloy systems GP zones 
form in this way at high super saturations, because 
the GP zone solvus (see Figure 8.1) gives rise to a 
metastable coherent miscibility gap. 

The spinodally decomposed micro structure is 
believed to have unusually good mechanical stability 
under fatigue conditions. 


8.3 Strengthening of steels by 
heat-treatment 

8.3.1 Time-temperature-transformation 
diagrams 

Eutectoid decomposition occurs in both ferrous 
(e.g. iron- carbon) and non-ferrous (e.g. cop- 
per-aluminium, copper-tin) alloy systems, but it is 
of particular importance industrially in governing the 
hardening of steels. In the iron-carbon system (see 
Figure 3.18) the y-phase, austenite, which is a solid 
solution of carbon in fee iron, decomposes on cool- 
ing to give a structure known as pearlite, composed 
of alternate lamellae of cementite (Fe 3 C) and ferrite. 
However, when the cooling conditions are such that 
the alloy structure is far removed from equilibrium, an 
alternative transformation may occur. Thus, on very 
rapid cooling, a metastable phase called martensite, 
which is a supersaturated solid solution of carbon in 
ferrite, is produced. The microstructure of such a trans- 
formed steel is not homogeneous but consists of plate- 
like needles of martensite embedded in a matrix of 
the parent austenite. Apart from martensite, another 
structure known as bainite may also be formed if 
the formation of pearlite is avoided by cooling the 
austenite rapidly through the temperature range above 
5 5 CPC, and then holding the steel at some temperature 
between 250°C and 550°C. A bainitic structure con- 
sists of platelike grains of ferrite, somewhat like the 
plates of martensite, inside which carbide particles can 
be seen. 

The structure produced when austenite is allowed 
to transform isothermally at a given temperature can 
be conveniently represented by a diagram of the type 
shown in Figure 8.18, which plots the time necessary 
at a given temperature to transform austenite of eutec- 
toid composition to one of the three structures: pearlite, 
bainite or martensite. Such a diagram, made up from 
the results of a series of isothermal-decomposition 
experiments, is called a TTT curve, since it relates the 
transformation product to the time at a given tempera- 
ture. It will be evident from such a diagram that a wide 
variety of structures can be obtained from the austenite 
decomposition of a particular steel; the structure may 
range from 100% coarse pearlite, when the steel will 
be soft and ductile, to fully martensitic, when the steel 
will be hard and brittle. It is because this wide range 
of properties can be produced by the transformation of 
a steel that it remains a major constructional material 
for engineering purposes. 
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Figure 8.18 TTT curves for (a) eutectoid, (b) hypo-eutectoid and (c) low alloy (e.g. Ni/Cr/Mo) steels (after ASM Metals 
Handbook ). 


From the TTT curve it can be seen that just below 
the critical temperature, A\, the rate of transforma- 
tion is slow even though the atomic mobility must 
be high in this temperature range. This is because any 
phase change involving nucleation and growth (e.g. 
the pearlite transformation) is faced with nucleation 
difficulties, which arise from the necessary surface and 
strain energy contributions to the nucleus. Of course, as 
the transformation temperature approaches the temper- 
ature corresponding to the knee of the curve, the trans- 
formation rate increases. The slowness of the transfor- 
mation below the knee of the TTT curve, when bainite 
is formed, is also readily understood, since atomic 
migration is slow at these lower temperatures and 


the bainite transformation depends on diffusion. The 
lower part of the TTT curve below about 250-300°C 
indicates, however, that the transformation speeds up 
again and takes place exceedingly fast, even though 
atomic mobility in this temperature range must be very 
low. For this reason, it is concluded that the marten- 
site transformation does not depend on the speed of 
migration of carbon atoms and, consequently, it is 
often referred to as a diffusionless transformation. The 
austenite only starts transforming to martensite when 
the temperature falls below a critical temperature, usu- 
ally denoted by M s . Below M s the percentage of 
austenite transformed to martensite is indicated on the 
diagram by a series of horizontal lines. 
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The M s temperature may be predicted for steels 
containing various alloying elements in weight per cent 
by the formula, due to Steven and Haynes, given by 
M S (°C) = 561 -474C-33Mn- 17Ni- 17Cr-21Mo. 

8.3.2 Austenite -pearlite transformation 

8.3.2. 1 Nucleation and growth of pearlite 
If a homogeneous austenitic specimen of eutectoid 
composition were to be transferred quickly to a 
bath held at some temperature between 720° C and 
550°C, decomposition curves of the form shown in 
Figure 8.19a would be obtained. These curves, typical 
of a nucleation and growth process, indicate that 
the transformation undergoes an incubation period, an 
accelerating stage and a decelerating stage; the volume 
transformed into pearlite has the time-dependence 
described by the Avrami equation (7.44). When the 
transformation is in its initial stage the austenite 
contains a few small pearlite nodules each of which 
grow during the period A to B (see curve obtained 
at 690° C) and, at the same time, further nuclei form. 
The percentage of austenite transformed is quite small, 
since the nuclei are small and their total volume 
represents only a fraction of the original austenite. 
During the B to C stage the transformation rate 
accelerates, since as each nodule increases in size the 
area of contact between austenite and pearlite regions 
also increases: the larger the pearlite volumes, the 
greater is the surface area upon which to deposit further 
transformation products. At C, the growing nodules 
begin to impinge on each other, so that the area of 
contact between pearlite and austenite decreases and 
from this stage onwards, the larger the nodules the 
lower is the rate of transformation. Clearly, the rate of 
transformation depends on (1) the rate of nucleation of 
pearlite nodules, N (i.e. the number of nuclei formed in 
unit volume in unit time), and (2) the rate of growth 
of these nodules, G (i.e. the rate that the radius of 
the nodule increases with time). The variation of N 


and G with temperature for a eutectoid steel is shown 
in Figure 8.19b. 

The rate of nucleation increases with decreasing 
temperature down to the knee of the curve and in 
this respect is analogous to other processes of phase 
precipitation where hysteresis occurs (see Chapter 3). 
In addition, the nucleation rate is very structure sen- 
sitive so that nucleation occurs readily in regions of 
high energy where the structure is distorted. In homo- 
geneous austenite the nucleation of pearlite occurs 
almost exclusively at grain boundaries and, for this 
reason, the size of the austenite grains, prior to quench- 
ing, has an important effect on hardenability (a term 
which denotes the depth in a steel to which a fully 
martensitic structure can be obtained). Coarse-grained 
steels can be hardened more easily than fine-grained 
steels because to obtain maximum hardening in a steel, 
the decomposition of austenite to pearlite should be 
avoided, and this is more easily accomplished if the 
grain boundary area, or the number of potential pearlite 
nucleation sites, is small. Thus, an increase in austen- 
ite grain size effectively pushes the upper part of the 
TTT curve to longer times, so that, with a given cool- 
ing rate, the knee can be avoided more easily. The 
structure- sensitivity of the rate of nucleation is also 
reflected in other ways. For example, if the austenite 
grain is heterogeneous, pearlite nucleation is observed 
at inclusions as well as at grain boundaries. Moreover, 
plastic deformation during transformation increases the 
rate of transformation, since the introduction of dislo- 
cations provides extra sites for nucleation, while the 
vacancies produced by plastic deformation enhance the 
diffusion process. 

The rate of growth of pearlite, like the rate of nucle- 
ation, also increases with decreasing temperature down 
to the knee of the curve, even though it is governed 
by the diffusion of carbon, which, of course, decreases 
with decreasing temperature. The reason for this is that 
the interlamellar spacing of the pearlite also decreases 
rapidly with decreasing temperature, and because the 
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Figure 8.19 Effect of temperature on (a) amount of pearlite formed with time and (b) rate of nucleation and rate of growth of 
pearlite (after Mehl and Hagel, 1956; courtesy of Pergamon Press). 
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carbon atoms do not have to travel so far, the carbon 
supply is easily maintained. In contrast to the rate of 
nucleation, however, the rate of growth of pearlite is 
quite structure-insensitive and, therefore, is indiffer- 
ent to the presence of grain boundaries or inclusions. 
These two factors are important in governing the size 
of the pearlite nodules produced. If, for instance, the 
steel is transformed just below A\, where the rate of 
nucleation is very low in comparison with the rate 
of growth (i.e. the ratio N/G is small), very large 
nodules are developed. Then, owing to the structure- 
insensitivity of the growth process, the few nodules 
formed are able to grow across grain boundaries, with 
the result that pearlite nodules larger than the original 
austenite grain size are often observed. By compari- 
son, if the steel is transformed at a lower temperature, 
just above the knee of the TTT curve where N/G is 
large, the rate of nucleation is high and the pearlite 
nodule size is correspondingly small. 

8. 3. 2. 2 Mechanism and morphology of pearlite 
formation 

The growth of pearlite from austenite clearly involves 
two distinct processes: (1) a redistribution of carbon 
(since the carbon concentrates in the cementite and 
avoids the ferrite) and (2) a crystallographic change 
(since the structure of both ferrite and cementite differs 
from that of austenite). Of these two processes it is 
generally agreed that the rate of growth is governed by 
the diffusion of carbon atoms, and the crystallographic 
change occurs as readily as the redistribution of carbon 
will allow. The active nucleus of the pearlite nodule 
may be either a ferrite or cementite platelet, depending 
on the conditions of temperature and composition 
which prevail during the transformation, but usually it 
is assumed to be cementite. The nucleus may form at a 
grain boundary as shown in Figure 8.20a, and after its 
formation the surrounding matrix is depleted of carbon, 
so that conditions favour the nucleation of ferrite plates 
adjacent to the cementite nucleus (Figure 8.20b). The 
ferrite plates in turn reject carbon atoms into the 
surrounding austenite and this favours the formation 
of cementite nuclei, which then continue to grow. At 
the same time as the pearlite nodule grows sideways, 
the ferrite and cementite lamellae advance into the 
austenite, since the carbon atoms rejected ahead of the 
advancing ferrite diffuse into the path of the growing 
cementite (Figure 8.20c). Eventually, a cementite plate 
of different orientation forms and this acts as a new 
nucleus as shown in Figures 8.20d and 8.20e. 

Homogeneous austenite, when held at a constant 
temperature, produces pearlite at a constant rate 
and with a constant interlamellar spacing. However, 
the interlamellar spacing decreases with decreasing 
temperature, and becomes irresolvable in the optical 
microscope as the temperature approaches that 
corresponding to the knee of the curve. An increase 
in hardness occurs as the spacing decreases. Zener 
explains the dependence of interlamellar spacing 



Figure 8.20 Nucleation and growth of pearlite nodules. 

(a) Initial FesC nucleus; (b) FesC plate full grown, ot-Fe 
now nucleated; (c) a-Fe plates now full grown, new FesC 
plates nucleated; (d) FesC nucleus of different orientation 
forms and original nodule grows; (e) new nodule at 
advanced stage of growth (after Mehl and Hagel; courtesy 
of Pergamon Press). 


on temperature in the following way. If the 
interlamellar spacing is large, the diffusion distance 
of the carbon atoms in order to concentrate in 
the cementite is also large, and the rate of carbon 
redistribution is correspondingly slow. Conversely, if 
the spacing is small the area, and hence energy, 
of the ferrite-cementite interfaces become large. 
In consequence, such a high proportion of the 
free energy released in the austenite to pearlite 
transformation is needed to provide the interfacial 
energy that little will remain to provide the ‘driving 
force’ for the change. Thus, a balance between 
these two opposing conditions is necessary to allow 
the formation of pearlite to proceed, and at a 
constant temperature the interlamellar spacing remains 
constant. However, because the free energy change, 
AG, accompanying the transformation increases with 
increasing degree of undercooling, larger interfacial 
areas can be tolerated as the temperature of 
transformation is lowered, with the result that 
the interlamellar spacing decreases with decreasing 
temperature. 

The majority of commercial steels are not usu- 
ally of the euctectoid composition (0.8% carbon), but 
hypo-eutectoid (i.e. <0.8% carbon). In such steels, 
pro-eutectoid ferrite is first formed before the pearlite 
reaction begins and this is shown in the TTT curve 
by a third decomposition line. From Figure 8.18b it 
can be seen that the amount of pro-eutectoid ferrite 
decreases as the isothermal transformation temperature 
is lowered. The morphology of the precipitated fer- 
rite depends on the usual precipitation variables (i.e. 
temperature, time, carbon content and grain size) and 
growth occurs preferentially at grain boundaries and 
on certain crystallographic planes. The Widmanstatten 



278 Modern Physical Metallurgy and Materials Engineering 


pattern with ferrite growing along {111} planes of the 
parent austenite is a familiar structure of these steels. 

8. 3. 2. 3 Influence of alloying elements on 
pearlite formation 

With the exception of cobalt, all alloying elements in 
small amounts retard the transformation of austenite 
to pearlite. These elements decrease both the rate of 
nucleation, N, and the rate of growth, G, so that the 
top part of the TTT curve is displaced towards longer 
times. This has considerable technological importance 
since in the absence of such alloying elements, a steel 
can only transform into the harder constituents of bai- 
nite or martensite if it is in the form of very thin 
sections so that the cooling rate will be fast enough to 
avoid crossing the knee of the TTT curve during the 
cooling process and hence avoid pearlite transforma- 
tion. For this reason, most commercially heat-treatable 
steels contain one or more of the elements chromium, 
nickel, manganese, vanadium, molybdenum or tung- 
sten. Cobalt increases both N and G, and its effect 
on the pearlite interlamellar spacing is contrary to the 
other elements in that it decreases the spacing. 

With large additions of alloying elements, the simple 
form of TTT curve often becomes complex, as shown 
in Figure 8.18c. Thus to obtain any desired structure 
by heat treatment a detailed knowledge of the TTT 
curve is essential. 

8.3.3 Austenite-martensite transformation 

8.3.3. 1 Crystallography of martensite formation 
Martensite, the hardening constituent in quenched 
steels, is formed at temperatures below about 200°C. 
The regions of the austenite which have transformed 
to martensite are lenticular in shape and may easily be 
recognized by etching, or from the distortion they pro- 
duce on the polished surface of the alloy. These relief 
effects, shown schematically in Figure 8.21, indicate 
that the martensite needles have been formed not with 
the aid of atomic diffusion but by a shear process, since 
if atomic mobility were allowed the large strain energy 
associated with the transformed volume would then be 
largely avoided. The lenticular shape of a martensite 
needle is a direct consequence of the stresses produced 
in the surrounding matrix by the shear mechanism of 
the transformation and is exactly analogous to the sim- 
ilar effect found in mechanical twinning. The strain 
energy associated with martensite is tolerated because 
the growth of such sheared regions does not depend on 
diffusion, and since the regions are coherent with the 
matrix they are able to spread at great speed through 
the crystal. The large free energy change associated 
with the rapid formation of the new phase outweighs 
the strain energy, so that there is a net lowering of 
free energy. 

Direct TEM observations of martensite plates have 
shown that there are two main types of martensite, one 
with a twinned structure (see Figure 8.22), known as 




(b) 


Figure 8.21 Schematic diagram of the observed shape 
deformation produced by a martensite plate, (a) Contour 
lines on an originally flat surface; (b) section of the surface 
through AB. Vertical scale much exaggerated ( after Bilby 
and Christian, 1956; courtesy of the Institute of Metals). 

acicular martensite, and the other with a high density 
of dislocations but few or no twins, called massive 
martensite. 

In contrast to the pearlite transformation, which 
involves both a redistribution of carbon atoms and 
a structural change, the martensite transformation 
involves only a change in crystal structure. 

The structure cell of martensite is body-centred 
tetragonal, which is a distorted form of a body-centred 
cubic structure, and hence may be regarded as a super- 
saturated solution of carbon in a-iron. X-ray examina- 
tion shows that while the c / a ratio of the bet structure 
of martensite increases with increasing carbon content, 
the curve of cl a ratio against composition extrapolates 
back to c/a = 1 for zero carbon content, and the lattice 
parameter is equal to that of pure a-iron (Figure 8.23). 

From the crystallographic point of view the most 
important experimental data in any martensite transfor- 
mation are the orientation relations of the two phases 
and the habit plane. In steel, there are three groups of 
orientations often quoted; those due to Kurdjumov and 
Sachs, Nishiyama, and Greninger and Troiano, respec- 
tively. According to the Kurdjumov -Sachs relation, in 
iron-carbon alloys with 0.5- 1.4% carbon, a {1 1 \} y 
plane of the austenite lattice is parallel to the {11 0} a 
plane of the martensite, with a (1 1 0) K axis of the 
former parallel to a (1 1 l) a axis of the latter; the asso- 
ciated habit plane is {225} y . In any one crystal there 
are 24 possible variants of the Kurdjumov -Sachs rela- 
tionship, consisting of 12 twin pairs, both orientations 
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Outline of block of austenite which 
has transformed to martensite 



Figure 8.22 (a) Formation of a martensite platelet in a crystal of austenite; (b) the inhomogeneous twinning shear within the 
platelet ( after Kelly and Nutting, 1960; courtesy of the Royal Society). 


of a pair having the same habit plane. However, for 
general discussion it is usual to choose one relation 
which may be written 

(1 1 l)yll(l 0 l) a with [ll0] y ||[l lT]« 

In the composition range 1.5 -1.8% carbon the habit 
plane changes to ~ {25 9} y with the orientation rela- 
tionship unspecified. This latter type of habit plane has 
also been reported by Nishiyama for iron -nickel alloys 
(27-34% nickel) for which the orientation relationship 
is of the form 

(1 1 l)yll(l 0 l) a with [1 2 l] y ||[1 0l] a 

However, Greninger and Troiano have shown by pre- 
cision orientation determinations that irrational rela- 
tionships are very probable, and that in a ternary 
iron -nickel -carbon alloy (0.8% carbon, 22% nickel), 
(1 1 1 ) Y is approximately 1° from (1 0 l) a with [1 2 l] y 
approximately 2° from [1 0 l] a , and is associated with 
a habit plane about 5° from (25 9). 



Figure 8.23 Variation ofc and a parameters with carbon 
content in martensite (after Kurdjumov, 1948). 


8. 3. 3. 2 Mechanism of martensite formation 

The martensite transformation is diffusionless, and 
therefore martensite forms without any interchange in 
the position of neighbouring atoms. Accordingly, the 
observed orientation relationships are a direct conse- 
quence of the atom movements that occur during the 
transformation. The first suggestion of a possible trans- 
formation mechanism was made by Bain in 1934. He 
suggested that since austenite may be regarded as a 
body-centred tetragonal structure of axial ratio V2, the 
transformation merely involves a compression of the 
c-axis of the austenite unit cell and expansion of the 
of-axis. The interstitially dissolved carbon atoms pre- 
vent the axial ratio from going completely to unity, 
and, depending on composition, the c / a ratio will be 
between 1.08 and 1.0. Clearly, such a mechanism can 
only give rise to three martensite orientations whereas, 
in practice, 24 result. To account for this, Kurdju- 
mov and Sachs proposed that the transformation takes 
place not by one shear process but by a sequence 
of two shears (Figure 8.24), first along the elements 
(1 1 l) y (l 1 2) y , and then a minor shear along the ele- 
ments (11 2) a (l \\) a \ these elements are the twinning 
elements of the fee and bcc lattice, respectively. This 
mechanism predicts the correct orientation relations, 
but not the correct habit characteristics or relief effects. 
Accordingly, Greninger and Troiano in 1941 proposed 
a different two- stage transformation, consisting of an 
initial shear on the irrational habit plane which pro- 
duces the relief effects, together with a second shear 
along the twinning elements of the martensite lattice. 
If slight adjustments in spacing are then allowed, the 
mechanism can account for the relief effects, habit 
plane, the orientation relationship and the change of 
structure. 

Further additions to these theories have been made 
in an effort to produce the ideal general theory 
of the crystallography of martensite transformation. 
Bowles, for example, replaces the first shear of the 
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Figure 8.24 Shear mechanisms of Kurdjumov and Sachs. 

(a) Face-centred austenite with {1 1 1 } y in horizontal plane, 

(b) body-centred tetragonal martensite (a'), (c) cubic ferrite 
(a) (after Bowles and Barrett, 1952; courtesy of Pergamon 
Press). 

Greninger-Troiano mechanism by the general type of 
homogeneous deformation in which the habit plane 
remains invariant, i.e. all directions in this plane are 
unrotated and unchanged in length. However, in all 
such cases the problem resolves itself into one of 
determining whether a homogeneous strain can trans- 
form the y-lattice into the o'-lattice, while preserv- 
ing coherency at the boundary between them. The 



(a) I 1 


homogeneous strain does not do this, so that some 
reasonable additional type of strain has to be added. 

This shear can occur either by twinning or by slip, 
the mode prevailing depending on the composition and 
cooling rate. Between carbon contents of 0.2% and 
0.5% the martensite changes from dislocated marten- 
site arranged in thin lathes or needles to twinned aci- 
cular martensite arranged in plates. In the martensite 
formed at low C contents (e.g. Fe-Ni alloys) the thin 
lathes lie parallel to each other, with a {1 1 \} Y habit, 
to form pockets of massive martensite with jagged 
boundaries due to the impingement of other nearby 
pockets of lathes. The inhomogeneous shear produced 
by deformation twinning occurs on {1 12} planes in the 
martensite, so that each martensite plate is made up of 
parallel twin plates of thickness 2-50 nm. By oper- 
ation of such a complex transformation mode with a 
high index habit plan the system maintains an invariant 
interfacial plane. 

Because of the shears involved and the speed of 
the transformation it is attractive to consider that 
dislocations play an important role in martensite for- 
mation. Some insight into the basic dislocation mech- 
anisms has been obtained by in situ observations 
during either cooling below M s or by straining, but 
unfortunately only for Ni-Cr austenitic steels with 
low stacking-fault energy (i.e. y ~ 20 mJ/m 2 ). For 
these alloys it has been found that stacking faults 
are formed either by emitting partial dislocations with 
b = a/ 6(1 1 2) from grain boundaries or by the dis- 
sociation of unit dislocations with b = a/ 2(110). In 
regions of the grain where on cooling or deformation 
a high density of stacking faults developed, the corre- 
sponding diffraction pattern revealed cph e-martensite. 
On subsequent deformation or cooling, regions of s- 
martensite transform rapidly into bcc o'-martensite, 
and indeed, the only way in which o'-martensite was 
observed to form was from an e nucleus. 



Figure 8.25 Electron micrographs showing (a) contrast from overlapping faults on (1 1 1 ); A is extrinsic and B is intrinsic in 
nature, (b) Residual contrast arising from a supplementary displacement across the faults which is intrinsic in nature for both 
faults A and B (after Brooks, Loretto and Smallman, 1979). 
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Because straining or cooling can be interrupted 
during the in situ experiments it was possible to 
carry out a detailed analysis of the defect structure 
formed prior to a region becoming recognizably 
(from diffraction patterns) martensitic. In this way 
it has been shown that the interplanar spacing 
across the individual stacking faults in the austenite 
decreased to the (0001) spacing appropriate to 
^-martensite. Figure 8.25 shows micrographs which 
reveal this change of spacing; no contrast is expected 
in Figure 8.25b if the faulted {111} planes remained 
at the fee spacing, since the condition of invisibility 
g.R = n is obeyed. The residual contrast observed 
arises from the supplementary displacement A R across 
the faults which, from the white outer fringe, is 
positive (intrinsic) in nature for both faults and ~ 
2% of the {111} spacing. The formation of regions 
of a from s could also be followed although in 
this case the speed of the transformation precluded 
detailed analysis. Figure 8.26 shows a micrograph 
taken after the formation of o'-martensite and this, 
together with continuous observations, show that the 
martensite/matrix interface changes from {1 1 1} to the 
well-known {225} as it propagates. Clearly, one of 
the important roles that the formation of e-martensite 
plays in acting as a precursor for the formation 
of o'-martensite is in the generation of close-packed 
planes with ABAB stacking so that atomic shuffles 
can subsequently transform these planes to {1 1 0} bcc 
which are, of course, stacked ABAB (see Figure 8.27). 
The o'-martensite forms in dislocation pile-ups where 
the a/ 6(1 1 2) partials are forced closer together by the 
applied stress. The volume of effective bcc material 
increases as more dislocations join the pile-up until the 
nucleus formed by this process reaches a critical size 
and rapid growth takes place. The martensite initially 
grows perpendicular to, and principally on, one side of 
the {1 1 \} Y slip plane associated with the nucleus, very 
likely corresponding to the side of the dislocations with 
missing half-planes since o'-martensite is less dense 
than austenite. 



Figure 8.26 Electron micrograph showing an a-martensite 
plate, the austenite -martensite interface, and the faults in 
the austenite matrix. 



Figure 8.27 A shear of aj 6(1 1 2) moves atoms in the fee 
structure from A sites to B sites, and after half this shear the 
structure has pseudo -bcc packing. 

8. 3. 3. 3 Kinetics of martensite formation 
One of the most distinctive features of the martensite 
transformation is that in most systems martensite is 
formed only when the specimen is cooling, and that 
the rate of martensite formation is negligible if cool- 
ing is stopped. For this reason, the reaction is often 
referred to as an athermal 1 transformation, and the 
percentage of austenite transformed to martensite is 
indicated on the TTT curve by a series of horizontal 
lines. The transformation begins at a temperature M s , 
which is not dependent on cooling rate, but is depen- 
dent on prior thermal and mechanical history, and on 
composition. For example, it is well established that 
the M s temperature decreases approximately linearly 
with increasing concentration of solutes such as car- 
bon, nickel or manganese. 

Speed of formation The observation that martensite 
plates form rapidly and at a rate which is 
temperature-independent shows that thermal activation 
is not required for the growth process. Electronic 
methods show that the martensite needles form, in 
iron -nickel -carbon alloys, for example, in about 
10 -7 s and, moreover, that the linear growth velocity 
is about 10 3 m s _1 even at very low temperatures. 
Such observations show that the activation energy for 
the growth of a martensite plate is virtually zero, and 
that the velocity of growth approaches the speed of 
sound in the matrix. Sometimes a ‘burst phenomenon’ 
is exhibited, as, for example, in iron-nickel alloys, 
when the stresses produced by one martensite plate 
assist in the nucleation of others. The whole process is 
autocatalytic and about 25% of the transformation can 
occur in the time interval of an audible click. 

The effect of applied stress Since the formation of 
martensite involves a homogeneous distortion of the 
parent structure, it is expected that externally applied 

! In some alloys, such as iron-manganese-carbon and 
iron -manganese -nickel, the martensitic transformation 
occurs isothermally. For these systems, growth is still very 
rapid but the nuclei are formed by thermal activation. 
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stresses will be of importance. Plastic deformation is 
effective in forming martensite above the M s tempera- 
ture, provided the temperature does not exceed a criti- 
cal value usually denoted by M d . However, cold work 
above M d may either accelerate or retard the transfor- 
mation on subsequent cooling. Even elastic stresses, 
when applied above the M s temperature and main- 
tained during cooling, can affect the transformation; 
uniaxial compression or tensile stresses raise the M s 
temperature while hydrostatic stresses lower the M s 
temperature. 

Stabilization When cooling is interrupted below M s , 
stabilization of the remaining austenite often occurs. 
Thus, when cooling is resumed martensite forms only 
after an appreciable drop in temperature. Such ther- 
mal stabilization has been attributed by some workers 
to an accumulation of carbon atoms on those dislo- 
cations important to martensite formation. This may 
be regarded as a direct analogue of the yield phe- 
nomenon. The temperature interval before transfor- 
mation is resumed increases with holding time and is 
analogous to the increase in yield drop accompanying 
carbon build-up on strain- ageing. Furthermore, when 
transformation in a stabilized steel does resume, it 
often starts with a ‘burst’, which in this case is analo- 
gous to the lower yield elongation. 

8.3.4 Austenite -bainite transformation 

The bainite reaction has many features common to 
both the pearlite and martensite reactions. The pearlite 
transformation involves the redistribution of carbon 
followed by a structure change, the martensite trans- 
formation involves the structure change alone, and, in 
contrast, the bainite transformation involves a struc- 
ture change followed by the redistribution of car- 
bon, which precipitates as a carbide. Consequently, 
the austenite -bainite decomposition may be regarded 
as a martensite transformation involving the diffusion 
of carbon atoms, so that, in this case, the rate of 
coherent growth is necessarily slow compared with 
that of martensite. Lower bainite is hardly distinguish- 
able from martensite tempered at the same tempera- 
ture, while upper bainite exhibits an acicular structure. 
The metallographic appearance of the transformed 
steel is found to alter continuously between these two 
extremes, the actual structure exhibited being governed 
by the diffusion rate of the carbon, which in turn 
depends on the temperature of the transformation. The 
hardness of the reaction product also increases con- 
tinuously with decreasing temperature, lower bainite 
being harder than upper bainite, which is harder than 
most fine pearlite. 

The ferrite in bainite has a martensite-like appear- 
ance and is, in most cases, clearly distinguished from 
both ferrite and pro-eutectoid ferrite formed in the 
pearlite range. The bainitic ferrite exhibits the same 
surface relief effects as martensite, while pro-eutectoid 
ferrite and pearlite do not. Such surface tilting is fur- 
ther evidence for a shear-like transformation, but the 


orientation relationship between austenite and bainite 
is not necessarily the same as that between austenite 
and martensite. In fact, the bainitic ferrite has the same 
orientation with respect to the parent austenite as does 
pro-eutectoid ferrite, which suggests that ferrite may 
nucleate bainite. 

8.3.5 Tempering of martensite 

The presence of martensite in a quenched steel, while 
greatly increasing its hardness and TS, causes the 
material to be brittle. Such behaviour is hardly surpris- 
ing, since the formation of martensite is accompanied 
by severe matrix distortions. The hardness and strength 
of martensite increase sharply with increase in C con- 
tent. Contributions to the strength arise from the carbon 
in solution, carbides precipitated during the quench, 
dislocations introduced during the transformation and 
the grain size. 

Although the martensite structure is thermodynam- 
ically unstable, the steel will remain in this condition 
more or less indefinitely at room temperature because 
for a change to take place bulk diffusion of car- 
bon, with an activation energy Q of approximately 
83 kJ/mol atom is necessary. However, because there 
is an exponential variation of the reaction rate with 
temperature, the steel will be able slowly to approach 
the equilibrium structure at a slightly elevated temper- 
ature, i.e. rate of reaction = A exp [—Q/kT]. Thus, by 
a carefully controlled tempering treatment, the quench- 
ing stresses can be relieved and some of the carbon can 
precipitate from the supersaturated solid solution to 
form a finely dispersed carbide phase. In this way, the 
toughness of the steel can be vastly improved with very 
little detriment to its hardness and tensile properties. 

The structural changes which occur on tempering 
may be considered to take place in three stages. In 
the primary stage, fine particles of a cph carbide 
phase (e-carbide) of composition about Fe 2 , 4 C, pre- 
cipitates, with the corresponding formation of low- 
carbon martensite. This low-carbon martensite grows 
at the expense of the high-carbon martensite until at 
the end of this stage the structure consists of retained 
austenite, e-carbide and martensite of reduced tetrago- 
nality. During the second stage any retained austenite 
in the steel begins to transform isothermally to bainite, 
while the third stage is marked by the formation of 
cementite platelets. The precipitation of cementite is 
accompanied by a dissolution of the e-carbide phase 
so that the martensite loses its remaining tetragonality 
and becomes bcc ferrite. The degree to which these 
three stages overlap will depend on the temperature 
of the anneal and the carbon content. In consequence, 
the final structure produced will be governed by the 
initial choice of steel and the properties, and hence 
thermal treatment, required. Alloying elements, with 
the exception of Cr, affect the tempering of marten- 
site. Plain carbon steels soften above 100°C owing to 
the early formation of £-carbide, whereas in Si-bearing 
steels the softening is delayed to above 250°C, since 
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Si stabilizes ^-carbide and delays its transformation 
to cementite. Alloying additions (see Table 8.2) thus 
enable the improvement in ductility to be achieved at 
higher tempering temperatures. 

When a steel specimen is quenched prior to temper- 
ing, quenching cracks often occur. These are caused 
by the stresses which arise from both the transfor- 
mation and the differential expansion produced when 
different parts of the specimen cool at different rates. 
To minimize such cracking, the desired properties of 
toughness and strength are often produced in the steel 
by alternative heat- treatment schedules; examples of 
these schedules are summarized in Figure 8.28, from 
which it will become evident that advantage is taken of 
the shape of the TTT curve to economize on the time 
the specimen is in the furnace, and also to minimize 
quenching stresses. During conventional annealing, for 
example, the steel is heated above the upper critical 
temperature and allowed to cool slowly in the fur- 
nace. In isothermal annealing the steel is allowed to 
transform in the furnace, but when it has completely 
transformed, the specimen is removed from the fur- 
nace and allowed to air-cool, thereby saving furnace 
time. In martempering, the knee of the TTT curve is 
avoided by rapid cooling, but the quench is interrupted 
above M s and the steel allowed to cool relatively 
slowly through the martensite range. With this treat- 
ment the thermal stresses set up by very rapid cooling 
are reduced. Such a procedure is possible because at 
the holding temperature there is ample time for the 
temperature to become equal throughout the sample 
before the transformation begins, and as a result the 
transformation occurs much more uniformly. After the 
transformation is complete, tempering is carried out 
in the usual way. In austempering, quenching is again 
arrested above M s and a bainite product, having sim- 
ilar properties to tempered martensite, is allowed to 
form. 

Alloying elements also lower the M s temperatures 
and, consequently, greater stresses and distortion are 
introduced during quenching. This can be minimized 
by austempering and martempering as discussed 
above, but such treatments are expensive. Alloying 


Table 8.2 Influence of alloying additions on tempering 


Element 

Retardation in 
tempering per 1% 
addition 

Ratio of retardation 
of tempering to 
depression of M s 

C 

-40 

negative 

Co 

8 

>8 

Cr 

0 

0 

Mn 

8 

0.24 

Mo 

17 

0.8 

Ni 

8 

0.5 

Si 

20 

1.8 

V 

30 

>1.0 

w 

10 

0.9 


elements should therefore be chosen to produce the 
maximum retardation of tempering for minimum 
depression of M s ; Table 8.2 shows that (1) C should 
be as low as possible, (2) Si and Co are particularly 
effective, and (3) Mo is the preferred element of the 
Mo, W, V group since it is easier to take into solution 
than V and is cheaper than W. 

Some elements, particularly Mo and V, produce 
quite high tempering temperatures. In quantities above 
about 1% for Mo and \% for V, a precipitation reac- 
tion is also introduced which has its maximum hard- 
ening effect at 550°C. This phenomenon of increased 
hardness by precipitation at higher temperatures is 
known as secondary hardening and may be classified 
as a fourth stage of tempering. 2-2^ Mo addition pro- 
duces adequate temper resistance and changes the pre- 
cipitate to Mo 2 C which is more resistant to overageing 
than Cr 7 C 3 which is present in most alloy steels. High 
V additions lead to undissolved V 4 C 3 at the quench- 
ing temperature, but 0.5 V in conjunction with 2Mo 
does not form a separate carbide during tempering but 
dissolves in the Mo 2 C. Cr also dissolves in Mo 2 C 
but lowers the lattice parameters of the carbide and 
hence lowers the temper resistance by decreasing the 
matrix/carbide mismatch. However, lCr may be toler- 
ated without serious reduction in temper resistance and 
reduces the tendency to quench crack. Si decreases the 
lattice parameter of matrix ferrite and hence increases 
temper resistance. A typical secondary hardening steel 
usually contains 0.4C, 2Mo, 0.5V, 0.5Si and 1.5Cr, 
with 1.8 GN/m 2 TS and 15% elongation. 

8.3.6 Thermo-mechanical treatments 

To produce steels with an improved strength/ductility 
ratio the heat-treatment may be modified to include 
a deformation operation in the process. The combined 
use of mechanical working and heat-treatment is gener- 
ally called thermo-mechanical treatment (THT). Three 
types of treatment have proved successful with marten- 
sitic and bainitic steels. These may be classified as 
follows: 

1. Deformation in the stable austenite range above A 3 
before transformation, i.e. (HTHT). 

2. Deformation below Ai before transformation; this 
(LTHT) low-temperature thermo-mechanical treat- 
ment is called ausforming. 

3. Deformation during isothermal transformation to 
pearlite, i.e. below A 3 , known as isoforming. 

The main advantage of HTHT is in grain refine- 
ment, and steels such as silicon- steels that recrystallize 
slowly are particularly suitable. It can, however, be 
applied to low-alloy high-carbon tool steels which are 
not suitable for ausforming, with significant increases 
in strength and toughness. The fatigue limit is also 
improved in many steels provided the deformation is 
limited to 25-30%. In ausforming, the deformation is 
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Figure 8.28 Diagrams showing the heat-treatment procedure during (a) isothermal annealing, (b) martempering and 
(c) austempering. 


usually carried out in the range 450-550°C and hence 
the steel must have a large bay in the TTT diagram 
to enable the deformation to be carried out. A suitable 
steel is Fe-0.35C-0.5Mn-l.5Ni-l.25Cr-0.25Mo for 
which the strength increases by about 4. 6-7. 7 MN/m 2 
for each per cent of deformation. The properties are 
improved as the deformation temperature is lowered, 
provided it is not below M s , and with high deformation 
treatments (>70%) strengths up to about E/70 with 
good ductility have been achieved. A very fine carbide 
dispersion is produced in the austenite during deforma- 
tion together with a high density of dislocations. The 
removal of carbon from solution in the austenite means 
that during transformation the martensite formed is less 
supersaturated in C and thus has lower tetragonality 
and is more ductile. The carbides also pin the dis- 
locations in the austenite, helping to retain some of 
them together with those formed during the transfor- 
mation. The martensite formed is therefore heavily dis- 
located with relatively stable dislocations (compared to 
those which would be formed by deforming marten- 
site at room temperature), and has superior strength 
and toughness. Such steels are, of course, somewhat 
difficult to machine. 

Isoforming has potential in improving the toughness 
of low-alloy steels. During isoforming to pearlite the 
normal ferrite/pearlite structure is modified, by the 
polygonization of sub-grains in the ferrite and the 
spheroidizing of cementite particles. Isoforming to 
bainite is also possible. 

8.4 Fracture and toughness 

8.4.1 Griffith micro-crack criterion 

Most materials break at a stress well below the theo- 
retical fracture stress, which is that stress, <r t , required 
to pull apart two adjoining layers of atoms. This stress 
varies with the distance between the atom planes and, 
as shown in Figure 8.29, may be approximated to a 
sine curve of wavelength such that 




Figure 8.29 Model for estimating the theoretical fracture 
strength cr t . 

where u is the displacement from the equilibrium 
spacing b. From Hooke’s law a = ( Eu/b ) and hence 
<j t = XE/2nb. Now in pulling apart the two atomic 
planes it is necessary to supply the surface energy y 
and hence 



so that the theoretical tensile strength is given by 

Ot = VWJb) (8.22) 

Glass fibres and both metallic and non-metallic 
whiskers have strengths approaching this ideal value 
of about E/ 10, but bulk metals even when tested under 
favourable conditions (e.g. at 4 K) can rarely withstand 
stresses above E/ 100. Griffith, in 1920, was the first to 
suggest that this discrepancy was due to the presence 
of small cracks which propagate through the crystal 
and cause fracture. Griffith’s theory deals with elastic 
cracks, where at the tip of the crack atomic bonds 
exist in every stage of elongation and fracture. As the 
crack grows, each of the bonds in its path take up the 
strain, and the work done in stretching and breaking 
these bonds becomes the surface energy y of the 
fractured faces. If separation of the specimen between 
two atomic layers occurs in this way, the theoretical 
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strength only needs to be exceeded at one point at a 
time, and the applied stress will be much lower than the 
theoretical fracture stress. The work done in breaking 
the bonds has to be supplied by the applied force, or by 
the elastic energy of the system. Assuming for a crack 
of length 2c that an approximately circular region of 
radius c is relieved of stress a and hence strain energy 
(a 2 /2E)nc 2 by the presence of the crack, the condition 
when the elastic strain of energy balances the increase 
of surface energy is given by 


d / 7 rc 2 a 2 \ 
d c V 2E ) 


d 

dc 


(ley) 


and leads to the well-known Griffith formula 



(8.23) 


for the smallest tensile stress o able to propagate a 
crack of length 2c. The Griffith criterion therefore 
depends on the assumption that the crack will spread 
if the decrease in elastic strain energy resulting from 
an increase in 2c is greater than the increase in surface 
energy due to the increase in the surface area of 
the crack. 

Griffith’s theory has been verified by experiments 
on glasses and polymers at low temperatures, where a 
simple process of fracture by the propagation of elastic 
cracks occurs. In such ‘weak’ brittle fractures there 
is little or no plastic deformation and y is mainly 
the surface energy (~1-10 J m -2 ) and the fracture 
strength Of ~ 10 _5 £’. In crystalline solids, however, 
the cracks are not of the elastic type and a plastic zone 
exists around the crack tip as shown in Figure 8.30. 
In such specimens, fracture cannot occur unless the 
applied tensile stress o is magnified to the theoretical 
strength <r t . For an atomically sharp crack (where the 
radius of the root of the crack r is of the order of b) of 
length 2c it can be shown that the magnified stress <r m 
will be given by <r m = G^/{cJr) which, if the crack is 
to propagate, must be equal to the theoretical fracture 
stress of the material at the end of the crack. It follows 
that substituting this value of <r t in equation (8.22) 
leads to the Griffith formula of equation (8.23). 

Figure 8.30 shows the way the magnified stress 
drops off with distance from the tip of the crack. 
Clearly, at some distance r y the stress reaches the 
yield stress and plastic flow occurs. There is thus a 
zone of plastic flow around the tip of radius r y . The 
larger the plastic zone, as in ductile metals, the more 
energy is absorbed in fracture. In ceramics this zone 
is usually small. 

In ‘strong’ fractures y is greatly increased by the 
contribution of the plastic work around the crack 
tip which increases the work required for crack 
propagation. The term y must now be replaced by (y + 
y p ) where y p is the plastic work term; generally (y + 
y p ) is replaced by G, the strain energy release rate, 



Figure 8.30 Variation of stress from the tip of a crack and 
the extent of the plastic zone, radius r y . 


so that equation (8.23) becomes the Orowan-Irwin 
relationship 

a = yj {EG I it c) (8.24) 

Here, G might be ~10 4 J m“ 2 and a f ~ 10~ 2 -10~ 3 E. 

8.4.2 Fracture toughness 

In engineering structures, particularly heat-treated 
steels, cracks are likely to arise from weld defects, 
inclusions, surface damage, etc. and it is necessary to 
design structures with the knowledge that cracks are 
already present and capable of propagation at stresses 
below the macroscopic yield stress as measured in a 
tensile test. Since different materials show different 
crack propagation characteristics (e.g. hard steel and 
glass) it is necessary for the design engineers to find 
the limiting design stress in terms of some property 
or parameter of the material. For this reason, a 
fracture toughness parameter is now being employed 
to measure the tendency of cracks of given dimensions 
to propagate under particular stress conditions. 

In Section 8.4.1 it was shown that o^fnc) = 
V(FG), which indicates that fast fracture will occur 
when a material is stressed to a value a and the crack 
reaches some critical size, or alternatively when a 
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material containing a crack is subjected to some critical 
stress <r, i.e. the critical combination of stress and crack 
length for fast fracture is a constant, *J(EG) for the 
material, where E is Young’s modulus and G is the 
strain energy release rate. The term o+J (ttc) is given 
the symbol K and is called the stress intensity factor 
with units MN m _3/2 . Fast fracture will then occur 
when K = K c , where K c [= V( EG C )] is the critical 
stress intensity factor, or more commonly the fracture 
toughness parameter. 

The fracture toughness of a material can alter 
markedly depending on whether the elastic-plastic 
field ahead of the crack approximates to plane strain 
or plane stress conditions, much larger values being 
obtained under plane stress conditions as operate in 
thin sheets. The important and critical factor is the 
size of the plastic zone in relation to the thickness of 
the section containing the crack. When this is small, 
as in thick plates and forgings, plane strain conditions 
prevail and the hydrostatic tension ahead of the crack 
results in a semi-brittle ‘flat’ fracture. When the value 
is large as in thin sheets of ductile metals plane stress 
conditions operate and the tension at the crack front is 
smaller, giving rise to a more ductile mode of failure. 
At intermediate values a mixed fracture, with a flat 
centre bordered by shear lips, is obtained. Thus with- 
out changing the structure or properties of the materials 
in any way it is possible to produce a large differ- 
ence in fracture toughness by changing the section 
thickness. For thick sections, when a state of complete 
constraint is more nearly approached, the values of K c 
and G c approach minimum limiting values. These val- 
ues are denoted by K\ c and G Ic and are considered to 
be material constants; the subscript I denotes the first 
mode of crack extension, i.e. the opening mode (see 
Figure 8.31). 

The general procedure in measuring the fracture 
toughness parameter is to introduce a crack of suit- 
able size into a specimen of suitable dimension and 
geometry. The specimen is then loaded at a slow rate 
and the crack extension measured up to the critical 
condition. The measurement of Ki c will be valid if the 
plastic zone size is small (by a factor 10) in relation 
to the cross-section of the specimen. The zone size 
r y may be obtained by equating the stress field of the 
crack at r = r y to the strength <r y of the material and 
is given by 

r y = 

An Ashby property chart of fracture toughness versus 
strength, given in Figure 8.32, shows that the size of 
the zone (broken lines) varies from atomic dimensions 
for brittle ceramics to tens of centimetres for ductile 
metals. 

In designing safe structures for a given load, the 
structure is required to yield before it breaks. For a 
minimum detectable crack size of 2c this condition is 
given by (Ki c /<r y ) > *Jnc. The safest material is the 
one with the greatest value of Ki c /cr y . Clearly, a high 


yield stress a y is also required. The chart shows that 
steel satisfies both these requirements and indicates 
why it is still the best material for highly stressed 
structures where weight is not important. 

Fracture toughness requirements are now written 
into the general specification of high-technology alloys 
and hence it is necessary to determine the effect 
of heat-treatment and alloying additions on fracture 
toughness parameters. Processes such as ausforming 
and controlled rolling improve the fracture toughness 
of certain steels. Carbon has a considerable effect and 
there are advantages in reducing the C-level below 
0.1% where possible. High-strength low-alloy (HSLA) 
steels have C^0.1% and the Nb, V and Ti addi- 
tions form fine carbides which together with the small 
grain sizes enable good strength levels and acceptable 
fracture toughness values to be achieved. Maraging 
steels with high alloy and low carbon (<0.04%) give 
very high strength combined with high toughness (see 
Chapter 9). 

The brittleness of ceramics is directly linked to their 
high notch-sensitivity. The fracture toughness of most 
ceramics is low: expressed in quantitative terms, it is 
commonly less than 8 MN m _3/2 . Flaws, often very 
minute, are almost invariably present in ceramics and 
act as stress-concentrating notches. It is extremely dif- 
ficult to prevent these flaws from forming during man- 
ufacture and service. In terms of engineering practice, 
brittle ceramic components are intolerant of misalign- 
ment and poor fits within assemblies. The presence of 
flaws is also responsible for the variability or ‘spread’ 
of results from mechanical tests and introduces uncer- 
tainty into the design process. (The ‘spread’ is much 
less for metallic materials.) Design procedures have 
moved well beyond the principle that ceramics are only 
safe when compressive stresses are dominant. Prob- 
abilistic assessments of mechanical test results from 
ceramics now tackle the difficult task of allowing for 
randomness in the size, shape and distribution of flaws 
(see Section 10.6). 

Despite these underlying problems, progress has 
been made in producing tough ‘ductile ceramics’. 



Figure 8.31 Variation in the fracture toughness parameter 
with section thickness. 
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Strength o- y /MN m-2 

Fraclure-loughness, K lc , plotted against strength. a y . The contours show the value of 
K 2 Wl n a 2 — roughly, the diameter of the process-zone at a crack tip (units: mm). The guide lines of constant 
y A \ja y and are used in yield-before-break and leak-before-break design. 

Figure 8.32 Ashby property chart of fracture toughness versus strength (Ashby, 1989, pp. 1273-93; with permission of 
Elsevier Science Ltd). 


Fracture toughness values above 20 MN m _3/2 have 
been achieved. Typical measures include elimination 
of flaws (microcracks, pores), incorporation of crack- 
retarding phases (in zirconia) and reduction of average 
grain size (below 1 mm). 

At ambient temperature, fracture surfaces of ceram- 
ics are conchoidal (glassy), intergranular or cleavage 
in character, depending on the material. The strain 
at fracture is very small, being about 0.001. At ele- 
vated temperatures, under creep conditions, fracture 
strain is greater. Conventional ceramics have little 
or no capacity for slip and crack tips therefore tend 
to remain sharp. As the crack propagates, the load- 
supporting cross-section gets smaller and the general 
level of stress increases so that failure can be sudden. 


These remarks refer to tensile stresses since compres- 
sive stresses will, of course, tend to close the crack. 

In some poly crystalline ceramics, such as magnesia, 
the von Mises criterion for maintenance of cohesions 
is not satisfied. Slip is limited and cracks are not effec- 
tively blunted. However, raising the temperature often 
enables the necessary minimum of five independent 
deformation modes to operate, leading to ductility. Any 
treatments which eliminate surface flaws will natu- 
rally enhance this ductility. (Against this background, 
one can readily appreciate why fabrication methods 
for poly crystalline ceramic components, in contrast to 
those for metals, are not based upon bulk deforma- 
tion.) At these higher temperatures the environment 
becomes increasingly more likely to react with surfaces 
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of the ceramic: it may even penetrate an open texture 
and cause crack-blunting. The mechanisms of flow 
in poly crystalline ceramics at elevated temperatures 
are similar to those encountered in metallic systems 
(e.g. glide and climb of dislocations, vacancy diffu- 
sion, grain boundary sliding). The similarity between 
deformation processes in poly crystalline ceramics and 
metals is evident if one compares the layout of the 
corresponding deformation-mechanism maps. A minor 
phase is usually present at grain boundary surfaces in 
ceramics, functioning as a ceramic bond, and if it soft- 
ens with rise in temperature, then the grains may be 
able to slide past each other. These regions tend to 
liquefy before the actual grains so that it is advisable 
not to exceed the solidus temperature. Grain boundary 
surfaces are particularly susceptible to the nucleation 
of cracks. As in the final accelerating stage of metal- 
lic creep, these cracks form in planes normal to the 
direction of applied stress. 

8.4.3 Cleavage and the ductile -brittle 
transition 

The fracture toughness versus strength chart, shown 
in Figure 8.32, indicates the wide spread of values 
for the different classes of material. Metals dissi- 
pate much energy in the plastic zone, which accounts 
for the difference between the fracture energy G Ic 
and the true surface energy y. The larger the zone, 
the more energy is absorbed; G\ c is high and so is 
/Gc(~100 MN m -3/2 ). Ceramics and glasses fracture 
without much plastic flow to blunt the cracks by sim- 
ply breaking atomic bonds, leading to cleavage; for 
these materials K\ c is less than 10 MN itT 3/2 . 

At low temperatures some metals, notably steels, 
also become brittle and fracture by cleavage. Since 
they are ductile at room temperature this transition 


to brittle cleavage behaviour is quite spectacular and 
has led to several engineering catastrophes. In general, 
brittle cleavage can occur in metals with bcc and cph 
under the appropriate conditions while in fee materials 
it does not. The most important factor linking these 
three different structures is the Peierls stress and the 
way the yield stress varies with temperature. In steel, 
for example, the yield stress increases rapidly with 
lowering of temperature below room temperature such 
that plastic deformation at the crack tip is minimized 
and the fracture mechanism changes from plastic tear- 
ing to cleavage. Even in these materials some plastic 
deformation does occur. 

Several models have been suggested for the pro- 
cess whereby glide dislocations are converted into 
micro-cracks. The simplest mechanism, postulated by 
Stroh, is that involving a pile-up of dislocations against 
a barrier, such as a grain boundary. The applied 
stress pushes the dislocations together and a crack 
forms beneath their coalesced half-planes, as shown 
in Figure 8.33a. A second mechanism of crack for- 
mation, suggested by Cottrell, is that arising from the 
junction of two intersecting slip planes. A dislocation 
gliding in the (101) plane coalesces with one gliding 
in the (101) plane to form a new dislocation which 
lies in the (001) plane according to the reaction, 

at 2[TT 1] + a/2[l 1 1] -* o[ 001] 

The new dislocation, which has a Burgers vector 
< 2 [ 001 ], is a pure edge dislocation and, as shown 
in Figure 8.33b, may be considered as a wedge, one 
lattice constant thick, inserted between the faces of the 
(0 01) planes. It is considered that the crack can then 
grow by means of other dislocations in the (101) and 
(1 01) planes running into it. Although the mechanism 
readily accounts for the observed (100) cleavage 




i 
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Figure 8.33 Formation of a crack (a) by the piling up of dislocations against a grain boundary after strain and (b) by the 
dislocations on (1 0 1) and (101 ) planes running together ( after Cottrell, 1958, p. 192). 
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plane of bcc metals, examples have not been directly 
observed. 

While these dislocation coalescence mechanisms 
may operate in single-phase materials, in two-phase 
alloys it is usually easier to nucleate cracks by piling up 
dislocations at particles (e.g. grain boundary cementite 
or cementite lamellae in pearlite). The pile-up stress 
then leads to cracking of either the particle or the 
particle/matrix interface. A brittle -ductile transition 
can then be explained on the basis of the criterion 
that the material is ductile at any temperature, if the 
yield stress at that temperature is smaller than the 
stress necessary for the growth of these micro-cracks, 
but if it is larger the material is brittle. If cleavage 
cracks are formed by such a dislocation mechanism, 
the Griffith formula may be rewritten to take account 
of the number of dislocations, n , forming the crack. 
Thus, rearranging Griffith’s formula we have 

g(2cg/E) = y = 2 cg 2 /E 

where the product of length 2c and the strain g/E is 
the maximum displacement between the faces of the 
crack. This displacement will depend on the number 
of dislocations forming the cleavage wedge and may 
be interpreted as a pile-up of n edge dislocations, each 
of Burgers vector a , so that equation (8.23) becomes 

o{na) = y (8.25) 

and gives a general criterion for fracture. Physically, 
this means that a number of glide dislocations, n, run 
together and in doing so cause the applied stress acting 
on them to do some work, which for fracture to occur 
must be at least sufficient to supply the energy to create 
the new cracked faces, i.e. (y + y p ). 

Qualitatively, we would expect those factors which 
influence the yield stress also to have an effect on the 
ductile -brittle fracture transition. The lattice ‘friction’ 
term g x , dislocation locking term k y , and grain size 2d 
should also all be important because any increase in 
Gi and k y , or decrease in the grain size, will raise the 
yield stress with a corresponding tendency to promote 
brittle failure. 

These conclusions have been put on a quantitative 
basis by Cottrell, who considered the stress needed 
to grow a crack at or near the tensile yield stress, 
G y , in specimens of grain diameter, 2d. Let us con- 
sider first the formation of a micro-crack. If r y is the 
actual shear stress operating, the effective shear stress 
acting on a glide band is only (r y — ri), where, it 
will be remembered, is the ‘friction’ stress resist- 
ing the motion of unlocked dislocations arising from 
the Peierls-Nabarro lattice stress, intersecting disloca- 
tions or groups of impurities. The displacement na is 
then given by 

na = [(r y — v x )/ fi\d (8.26) 

where /x is the shear modulus and d is the length 
of the slip band containing the dislocations (assumed 


here to be half the grain diameter). Once a microcrack 
is formed, however, the whole applied tensile stress 
normal to the crack acts on it, so that g can be written 
as (r y x constant), where the constant is included to 
account for the ratio of normal stress to shear stress. 
Substituting for na and g in the Griffith formula 
(equation (8.25)) then fracture should be able to occur 
at the yield point when g = <r y and 

r y (r y — Ti )d = C/xy (8.27) 

where C is a constant. The importance of the avalanche 
of dislocations produced at the yield drop can be seen if 
we replace r y by (constant x <j y ), by (constant x of) 
and use the Petch relationship <r y = 0 [ + k y d~ l/1 , when 
equation (8.27) becomes 

(<Xjd 1/2 + k y )ky = Pfj,y (8.28) 

where /3 is a constant which depends on the stress 
system; p ~ 1 for tensile deformation and p ~ | for a 
notched test. 

This is a general equation for the propagation of a 
crack at the lower yield point and shows what factors 
are likely to influence the fracture process. Alternative 
models for growth-controlled cleavage fracture have 
been developed to incorporate the possibility of carbide 
particles nucleating cracks. Such models emphasize 
the importance of yield parameters and grain size as 
well as carbide thickness. Coarse carbides give rise to 
low fracture stresses, thin carbides with high fracture 
stresses and ductile behaviour. 

8.4.4 Factors affecting brittleness of steels 

Many of the effects of alloying, heat treatment, and 
condition of testing on brittle fracture can be rational- 
ized on the basis of the above ‘transition’ equation. 

Ductile -brittle transition Under conditions where 
the value of the left-hand side of equation (8.28) is less 
than the value of the right-hand side, ductile behaviour 
should be observed; when the left-hand side exceeds 
the right-hand side the behaviour should be brittle. 
Since the right-hand side of equation (8.28) varies only 
slowly with temperature, it is the way in which changes 
occur in values of the terms on the left of the equation 
which are important in determining the ductile -brittle 
transition. Thus, in a given material brittleness should 
be favoured by low temperatures and high strain-rate, 
because these give rise to large values of g \ and k y , 
and by large grain sizes. On the right-hand side, the 
typical effect of a sharp notch is to raise the transition 
temperature of structural steel from around 100 K for 
a normal tensile test into the range of 200-300 K, 
because the value of p is lowered. 

Effects of composition and grain size At a con- 
stant temperature, because the values of Gi and k y 
remain fixed, the transition point will occur at a critical 
grain size above which the metal is brittle and below 
which it is ductile. 
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Figure 8.34 Influence of hydrogen on fracture behaviour showing (a) time -dependence and (b) temperature-dependence. 


The inclusion in equation (8.28) of the grain-size 
term, d 1/2 , in combination with the Oi term, enables 
many previous metallurgical misunderstandings to be 
cleared up. It shows that there is no simple connection 
between hardness and brittleness, since hardening pro- 
duced by refining the grain size reduces the brittleness, 
whereas hardening due to an increase in <Ti increases 
the brittleness. 

Heat treatment is generally used to control the grain 
size of the sample and refine the structure. ‘Killed’ 
steel has very good notch toughness, because alu- 
minium additions refine the grain size. Manganese 
reduces the grain size and by combining with car- 
bon also reduces the k y value so that this addition 
is especially beneficial in improving low-temperature 
ductility. It is fairly evident that an improved notch 
toughness steel, compared with that used for welded 
ships in World War II, is given by increasing the man- 
ganese content and decreasing the carbon content, i.e. a 
high manganese-to-carbon ratio. Other additions, par- 
ticularly nickel and chromium, have a similar effect on 
low-temperature ductility. 

The Group 6A metals (Cr, Mo and W) are more 
susceptible to brittle fracture than the Group 5A met- 
als (V, Nb and Ta). A comparison of these metals in 
terms of cleavage fracture is difficult, however, since 
Cr, Mo and W are susceptible to grain boundary frac- 
ture because segregation of impurities to such regions 
reduces the effective surface energy y. However, even 
if this effect is eliminated by lowering the impurity 
level, it appears that Ta, Nb and V are more ductile 
than Fe, Mo, Cr and W, presumably because they have 
a lower k y /fi ratio, and a higher y value. 

Work-hardening and irradiation-hardening Small 
amounts of plastic deformation at room temperature, 
which overcomes the yield point and unlocks some of 
the dislocations, improves the ductility at low temper- 
atures. The room- temperature ductility of chromium is 
similarly affected by small amounts of plastic deforma- 
tion at 400° C. In general, however, plastic deformation 
which leads to work-hardening embrittles the metal 


because it raises the 01 contribution, due to the forma- 
tion of intersecting dislocations, vacancy aggregates 
and other lattice defects. 

The importance of twins in fracture is not clear as 
there are several mechanisms other than twinning for 
the formation of a crack which can initiate fracture, 
and there is good evidence that micro-cracks form in 
steel in the absence of twins, and that cracks start 
at inclusions. Nevertheless, twinning and cleavage are 
generally found under similar conditions of tempera- 
ture and strain-rate in bcc transition metals, probably 
because both phenomena occur at high stress levels. 
The nucleation of a twin requires a higher stress than 
the propagation of the twin interface. 

Irradiation-hardening also embrittles the metal. 
According to the theory of this type of hardening 
outlined in Chapter 7, radiation damage can produce 
an increase in both k y (migration to dislocations 
of vacancies or interstitials) and (formation of 
dislocation loops and other aggregated defects). 
However, for steel, radiation-hardening is principally 
due to an increased contribution, presumably 
because the dislocations in mild steel are already too 
heavily locked with carbon atoms for any change 
in the structure of the dislocation to make any 
appreciable difference to k y . Nevertheless, a neutron 
dose of 1.9 x 10 23 n m 2 will render a typical fine- 
grained, unnotched mild steel, which is normally 
ductile at — 196°C, quite brittle. Moreover, experiments 
on notched fine-grained steel samples (see Figure 7.1c) 
show that this dose increases the ductile -brittle 
transition temperature by 65 °C. 

Micro structure The change in orientation at indi- 
vidual grain boundaries impedes the propagation of 
the cleavage crack by (1) creating cleavage steps, 
(2) causing localized deformation, and (3) tearing near 
the grain boundary. It is the extra work done (y p ) in 
such processes which increases the apparent surface 
energy y to (y s + y p ). It follows, therefore, that the 
smaller the distance a crack is able to propagate with- 
out being deviated by a change of orientation of the 
cleavage plane, the greater is the resistance to brittle 



Strengthening and toughening 291 


fracture. In this respect, the coarser high-temperature 
products of steel, such as pearlite and upper bainite, 
have inferior fracture characteristics compared with 
the finer lower bainite and martensite products. The 
fact that coarse carbides promote cleavage while fine 
carbides lead to ductile behaviour has already been 
discussed. 

8.4.5 Hydrogen embrittlement of steels 

It is well known that ferritic and martensitic steels 
are severely embrittled by small amounts of hydrogen. 
The hydrogen may be introduced during melting and 
retained during the solidification of massive steel cast- 
ings. Plating operations (e.g. Cd plating of steel for 
aircraft parts) may also lead to hydrogen embrittle- 
ment. Hydrogen can also be introduced during acid 
pickling or welding, or by exposure to H 2 S atmo- 
spheres. 

The chief characteristics of hydrogen embrittle- 
ment are its (1) strain-rate sensitivity, (2) temperature- 
dependence and (3) susceptibility to produce delayed 
fracture (see Figure 8.34). Unlike normal brittle frac- 
ture, hydrogen embrittlement is enhanced by slow 
strain-rates and consequently, notched-impact tests 
have little significance in detecting this type of embrit- 
tlement. Moreover, the phenomenon is not more com- 
mon at low temperatures, but is most severe in some 
intermediate temperature range around room tempera- 
ture (i.e. — 100°C to 100°C). These effects have been 
taken to indicate that hydrogen must be present in the 
material and must have a high mobility in order to 
cause embrittlement in poly crystalline aggregates. 

A commonly held concept of hydrogen embrittle- 
ment is that monatomic hydrogen precipitates at inter- 
nal voids or cracks as molecular hydrogen, so that as 
the pressure builds up it produces fracture. Alterna- 
tively, it has been proposed that the critical factor is 
the segregation of hydrogen, under applied stress, to 
regions of triaxial stress just ahead of the tip of the 
crack, and when a critical hydrogen concentration is 
obtained, a small crack grows and links up with the 
main crack. Hydrogen may also exist in the void or 
crack but it is considered that this has little effect on 
the fracture behaviour, and it is only the hydrogen in 
the stressed region that causes embrittlement. Neither 
model considers the Griffith criterion, which must be 
satisfied if cracks are to continue spreading. 

An application of the fracture theory may be made to 
this problem. Thus, if hydrogen collects in microcracks 
and exerts internal pressure P, the pressure may be 
directly added to the external stress to produce a total 
stress (P + p ) for propagation. Thus the crack will 
spread when 

(P + p)na = (y s + y p ) (8.29) 

where the surface energy is made up from a true sur- 
face energy y s and a plastic work term y p . The possi- 
bility that hydrogen causes embrittlement by becoming 
adsorbed on the crack surfaces thereby lowering y is 


thought to be small, since the plastic work term y p 
is the major term controlling y, whereas adsorption 
would mainly effect y s . 

Supersaturated hydrogen atoms precipitate as 
molecular hydrogen gas at a crack nucleus, or the 
interface between non-metallic inclusions and the 
matrix. The stresses from the build-up of hydrogen 
pressure are then relieved by the formation of 
small cleavage cracks. Clearly, while the crack is 
propagating, an insignificant amount of hydrogen will 
diffuse to the crack and, as a consequence, the pressure 
inside the crack will drop. However, because the length 
of the crack has increased, if a sufficiently large and 
constant stress is applied, the Griffith criterion will 
still be satisfied and completely brittle fracture can, 
in theory, occur. Thus, in iron single crystals, the 
presence or absence of hydrogen appears to have 
little effect during crack propagation because the crack 
has little difficulty spreading through the crystal. In 
poly crystalline material, however, the hydrogen must 
be both present and mobile, since propagation occurs 
during tensile straining. 

When a sufficiently large tensile stress is applied 
such that (p + P) is greater than that required by the 
Griffith criterion, the largest and sharpest crack will 
start to propagate, but will eventually be stopped at a 
micro structural feature, such as a grain boundary, as 
previously discussed. The pressure in the crack will 
then be less than in adjacent cracks which have not 
been able to propagate. A concentration gradient will 
then exist between such cracks (since the concentra- 
tion is proportional to the square root of the pressure 
of hydrogen) which provides a driving force for dif- 
fusion, so that the hydrogen pressure in the enlarged 
crack begins to increase again. The stress to propa- 
gate the crack decreases with increase in length of 
crack, and since p is increased by straining, a smaller 
increment AP of pressure may be sufficient to get the 
crack restarted. The process of crack propagation fol- 
lowed by a delay time for pressure build-up continues 
with straining until the specimen fails when the area 
between the cracks can no longer support the applied 
load. In higher strain-rate tests the hydrogen is unable 
to diffuse from one stopped crack to another to help 
the larger crack get started before it becomes blunted 
by plastic deformation at the tip. The decrease in the 
susceptibility to hydrogen embrittlement in specimens 
tested at low temperatures results from the lower pres- 
sure build-up at these temperature since PV = 3 nRT, 
and also because hydrogen has a lower mobility. 

8.4.6 Intergranular fracture 

Intergranular brittle failures are often regarded as a 
special class of fracture. In many alloys, however, 
there is a delicate balance between the stress required 
to cause a crack to propagate by cleavage and that 
needed to cause brittle separation along grain bound- 
aries. Although the energy absorbed in crack propa- 
gation may be low compared to cleavage fractures, 



292 Modern Physical Metallurgy and Materials Engineering 


much of the analysis of cleavage is still applica- 
ble if it is considered that chemical segregation to 
grain boundaries or crack faces lowers the surface 
energy y of the material. Fractures at low stresses are 
observed in austenitic chromium -nickel steels, due to 
the embrittling effect of intergranular carbide precipi- 
tation at grain boundaries. High transition temperatures 
and low fracture stresses are also common in tungsten 
and molybdenum as a result of the formation of thin 
second-phase films due to small amounts of oxygen, 
nitrogen or carbon. Similar behaviour is observed in 
the embrittlement of copper by antimony and iron by 
oxygen, although in some cases the second-phase films 
cannot be detected. 

A special intergranular failure, known as temper 
embrittlement, occurs in some alloy steels when tem- 
pered in the range 5 00 - 600° C. This phenomenon is 
associated with the segregation of certain elements or 
combinations of elements to the grain boundaries. The 
amount segregated is very small (~ a monolayer) but 
the species and amount has been identified by AES on 
specimens fractured intergranularly within the ultra- 
high vacuum of the Auger system. Group VIB ele- 
ments are known to be the most surface- active in iron 
but, fortunately, they combine readily with Mn and Cr 
thereby effectively reducing their solubility. Elements 
in Groups IVB and VB are less surface-active but often 
co-segregate in the boundaries with Ni and Mn. In 
Ni-Cr steels, the co-segregation of Ni-P and Ni-Sb 
occurs, but Mo additions can reduce the tendency for 
temper embrittlement. Since carbides are often present 
in the grain boundaries, these can provide the crack 
nucleus under the stress concentration from disloca- 
tion pile-ups either by cracking or by decohesion of 
the ferrite/carbide interface, particularly if the interfa- 
cial energy has been lowered by segregation. 


8.4.7 Ductile failure 

Ductile failure was introduced in Chapter 4 because 
of the role played by voids in the failure processes, 
which occurs by void nucleation, growth and coa- 
lescence. The nucleation of voids often takes place 
at inclusions. The dislocation structure around parti- 
cle inclusions leads to a local rate of work-hardening 
higher than the average and the local stress on reach- 
ing some critical value <r c will cause fracture of the 
inclusion or decohesion of the particle/matrix inter- 
face, thereby nucleating a void. The critical nucleation 
strain s n can be estimated and lies between 0.1 and 
1.0 depending on the model. For dispersion-hardening 
materials where dislocation loops are generated the 
stress on the interface due to the nearest prismatic 
loop, at distance r, is /ib/r, and this will cause sepa- 
ration of the interface when it reaches the theoretical 
strength of the interface, of order y w /b. The param- 
eter r is given in terms of the applied shear strain 
s, the particle diameter d and the length k equal to 
half the mean particle spacing as r = 4 kb /sd. Hence, 
void nucleation occurs on a particle of diameter d after 
a strain e, given by e = 4ky w / ptdb. Any stress con- 
centration effect from other loops will increase with 
particle size, thus enhancing the particle size depen- 
dence of strain to voiding. 

Once nucleated, the voids grow until they coalesce 
to provide an easy fracture path. A spherical-shaped 
void concentrates stress under tensile conditions and, 
as a result, elongates initially at about C(~2) times 
the rate of the specimen, but as it becomes ellipsoidal 
the growth-rate slows until finally the elongated void 
grows at about the same rate as the specimen. At 
some critical strain, the plasticity becomes localized 
and the voids rapidly coalesce and fracture occurs. The 
localization of the plasticity is thought to take place 
when the voids reach a critical distance of approach, 
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Figure 8.35 Schematic representation of ductile fracture, (a) Voids nucleate at inclusions, (b) voids elongate as the specimen 
extends, (c) voids coalesce to cause fracture when their length 2h is about equal to their separation (after Ashby et al., 1979). 




given when the void length 2 h is approximately equal 
to the separation, as shown in Figure 8.35. The true 
strain for coalescence is then 

e = (l/C)ln[a(2Z - 2r v )/2r v ] 

~(l/C)ln[a(l// v 1/2 )-l] (8.30) 

where a ~ 1 and / v is the volume fraction of inclu- 
sions. 

Void growth leading to failure will be much more 
rapid in the necked portion of a tensile sample follow- 
ing instability than during stable deformation, since the 
stress system changes in the neck from uniaxial tension 
to approximately plane strain tension. Thus the over- 
all ductility of a specimen will depend strongly on the 
macroscopic features of the stress -strain curves which 
(from Considered criterion) determines the extent of 
stable deformation, as well as on the ductile rupture 
process of void nucleation and growth. Nevertheless, 
an equation of the form of (8.30) reasonably describes 
the fracture strain for cup and cone failures. 

The work of decohesion influences the progress of 
voiding and is effective in determining the overall 
ductility in a simple tension test in two ways. The onset 
of voiding during uniform deformation depresses the 
rate of work-hardening which leads to a reduction in 
the uniform strain, and the void density and size at the 
onset of necking determines the amount of void growth 
required to cause ductile rupture. Thus for matrices 
having similar work-hardening properties, the one with 
the least tendency to ‘wet’ the second phase will show 
both lower uniform strain and lower necking strain. 
For matrices with different work-hardening potential 
but similar work of decohesion the matrix having 
the lower work-hardening rate will show the lower 
reduction prior to necking but the greater reduction 
during necking, although two materials will show 
similar total reductions to failure. 

The degree of bonding between particle and matrix 
may be determined from voids on particles annealed 
to produce an equilibrium configuration by measuring 
the contact angle 9 of the matrix surface to the particle 
surface. Resolving surface forces tangential to the 
particle, then the specific interface energy y\ is given 
approximately in terms of the matrix surface energy 
y m and the particle surface energy y P as y l = y P — 
y m cos 0. The work of separation of the interface y w is 
then given by 

Kw = yp + Km — Yi = Km(l +COS0) (8.31) 

Measurements show that the interfacial energy of TD 
nickel is low and hence exhibits excellent ductility 
at room temperature. Specific additions (e.g. Zr to 
TD nickel, and Co to Ni-Al 2 0 3 alloys) are also 
effective in lowering the interfacial energy, thereby 
causing the matrix to ‘wet’ the particle and increase the 
ductility. Because of their low y i, dispersion-hardened 
materials have superior mechanical properties at high 
temperatures compared with conventional hardened 
alloys. 
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Figure 8.36 Schematic representation of rupture with 
dynamic recrystallization (after Ashby et al., 1979). 

8.4.8 Rupture 

If the ductile failure mechanisms outlined above are 
inhibited then ductile rupture occurs (see Figure 8.36). 
Specimens deformed in tension ultimately reach a 
stage of mechanical instability when the deformation is 
localized either in a neck or in a shear band. With con- 
tinued straining the cross-section reduces to zero and 
the specimen ruptures, the strain-to-rupture depending 
on the amount of strain before and after localization. 
These strains are influenced by the work-hardening 
behaviour and strain-rate sensitivity. Clearly, rupture 
is favoured when void nucleation and/or growth is 
inhibited. This will occur if (1) second-phase particles 
are removed by zone -refining or dissolution at high 
temperatures, (2) the matrix/particle interface is strong 
and £ n is high, (3) the stress state minimizes plas- 
tic constraint and plane strain conditions (e.g. single 
crystals and thin sheets), (4) the work-hardening rate 
and strain-rate sensitivity is high as for superplastic 
materials (in some superplastic materials voids do not 
form but in many others they do and it is the growth 
and coalescence processes which are suppressed), and 
(5) there is stress relief at particles by recovery or 
dynamic recrystallization. Rupture is observed in most 
fee materials, usually associated with dynamic recrys- 
tallization. 

8.4.9 Voiding and fracture at elevated 
temperatures 

Creep usually takes place above 0.3 T m with a rate 
given by s = Bcr n y where B and n are material param- 
eters, as discussed in Chapter 7. Under such condi- 
tions ductile failure of a transgranular nature, sim- 
ilar to the ductile failure found commonly at low 
temperatures, may occur, when voids nucleated at 
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inclusions within the grains grow during creep defor- 
mation and coalesce to produce fracture. However, 
because these three processes are occurring at T ~ 
0.3 T m , local recovery is taking place and this delays 
both the onset of void nucleation and void coales- 
cence. More commonly at lower stresses and longer 
times-to-fracture, intergranular rather than transgranu- 
lar fracture is observed. In this situation, grain bound- 
ary sliding leads to the formation of either wedge 
cracks or voids on those boundaries normal to the 
tensile axis, as shown schematically in Figure 8.37b. 
This arises because grain boundary sliding produces a 
higher local strain-rate on an inclusion in the bound- 
ary than in the body of the grain, i.e. £i oca i — Kfd/2r) 
where / ~ 0.3 is the fraction of the overall strain due 
to sliding. The local strain therefore reaches the crit- 
ical nucleation strain s n much earlier than inside the 
grain. 

The time-to-fracture f f is observed to be a (l/e ss ), 
which confirms that fracture is controlled by power- 
law creep even though the rounded- shape of grain 
boundary voids indicates that local diffusion must con- 
tribute to the growth of the voids. One possibility is 
that the void nucleation, even in the boundary, occu- 
pies a major fraction of the lifetime tf, but a more 
likely general explanation is that the nucleated voids 
or cracks grow by local diffusion controlled by creep in 
the surrounding grains. Figure 8.37c shows the voids 
growing by diffusion, but between the voids the mate- 
rial is deforming by power-law creep, since the dif- 
fusion fields of neighbouring voids do not overlap. 
Void growth therefore depends on coupled diffusion 
and power-law creep, with the creep deformation con- 
trolling the rate of cavity growth. It is now believed 


that most intergranular creep fractures are governed by 
this type of mechanism. 

At very low stresses and high temperatures where 
diffusion is rapid and power-law creep negligible, the 
diffusion fields of the growing voids overlap. Under 
these conditions, the grain boundary voids are able to 
grow entirely by boundary diffusion; void coalescence 
then leads to fracture by a process of creep cavita- 
tion (Figure 8.38). In uniaxial tension the driving force 
arises from the process of taking atoms from the void 
surface and depositing them on the face of the grain 
that is almost perpendicular to the tensile axis, so that 
the specimen elongates in the direction of the stress 
and work is done. The vacancy concentration near the 
tensile boundary is c 0 exp(cr£2/kr) and near the void of 
radius r is c 0 exp(2y^/rkT), as discussed previously 
in Chapter 7, where £2 is the atomic volume and y the 
surface energy per unit area of the void. Thus vacan- 
cies flow usually by grain boundary diffusion from the 
boundaries to the voids when a > 2 y/r, i.e. when the 
chemical potential difference (oT2 — 2 yQ/r) between 
the two sites is negative. For a void r — 10 -6 m and 
y ~ 1 J/m 2 the minimum stress for hole growth is 
~2 MN/m 2 . In spite of being pure diffusional con- 
trolled growth, the voids may not always maintain their 
equilibrium near- spherical shape. Rapid surface diffu- 
sion is required to keep the balance between growth 
rate and surface redistribution and with increasing 
stress the voids become somewhat flattened. 

8.4.10 Fracture mechanism maps 

The fracture behaviour of a metal or alloy in different 
stress and temperature regimes can be summarized 
conveniently by displaying the dominant mechanisms 



Figure 8.37 Intergranular, creep-controlled, fracture. Voids nucleated by grain boundary sliding (a) and (b) growth by 
diffusion in (c) (after Ashby et al., 1979). 
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Figure 8.38 Voids lying on ‘ tensile ’ grain boundaries (a) 
grow by grain boundary diffusion (b) (after Ashby et al., 
1979). 

on a fracture mechanism map. Seven mechanisms have 
been identified, three for brittle behaviour including 
cleavage and intergranular brittle fracture, and four 
ductile processes. Figure 8.39 shows schematic maps 
for fee and bcc materials, respectively. Not all the frac- 
ture regimes are exhibited by fee materials, and even 
some of the ductile processes can be inhibited by alter- 
ing the metallurgical variables. For example, intergran- 
ular creep fracture is absent in high-purity aluminium 
but occurs in commercial-purity material, and because 
the dispersoid suppresses dynamic recrystallization in 
TD nickel, rupture does not take place whereas it does 
in Nimonic alloys at temperatures where the /' and 
carbides dissolve. 

In the bcc metals, brittle behaviour is separated into 
three fields; a brittle failure from a pre-existing crack, 
well below general yield, is called either cleavage 
1 or brittle intergranular fracture BIF1, depending 
on the fracture path. An almost totally brittle failure 
from a crack nucleated by slip or twinning, below 
general yield, is called either cleavage 2 or BIF2, and 


a cleavage or brittle boundary failure after general 
yield and with measurable strain-to-failure is called 
either cleavage 3 or BIF3. In many cases, mixed 
transgranular and intergranular fractures are observed, 
as a result of small changes in impurity content, texture 
or temperature which cause the crack to deviate from 
one path to another, no distinction is then made in 
the regime between cleavage and BIF. While maps 
for only two structures are shown in Figure 8.39 it is 
evident that as the bonding changes from metallic to 
ionic and covalent the fracture-mechanism fields will 
move from left to right: refractory oxides and silicates, 
for example, exhibit only the three brittle regimes and 
intergranular creep fracture. 

8.4.11 Crack growth under fatigue conditions 

Engineering structures such as bridges, pressure ves- 
sels and oil rigs all contain cracks and it is necessary 
to assess the safe life of the structure, i.e. the num- 
ber of stress cycles the structure will sustain before a 
crack grows to a critical length and propagates catas- 
trophically. The most effective approach to this prob- 
lem is by the use of fracture mechanics. Under static 
stress conditions, the state of stress near a crack tip is 
described by K , the stress intensity factor, but in cyclic 
loading K varies over a range A K(= K max — K m in ). 
The cyclic stress intensity A K increases with time at 
constant load, as shown in Figure 8.40a, because the 
crack grows. Moreover, for a crack of length a the rate 
of crack growth ( da/dN ) in /xm per cycle varies with 
A K according to the Paris -Erdogan equation 

da/dN = C(AK) m (8.32) 

where C and m are constants, with m between 2 and 
4. A typical crack growth rate curve is shown in 
Figure 8.40b and exhibits the expected linear relation- 
ship over part of the range. The upper limit corre- 
sponds to Kic, the fracture toughness of the material, 
and the lower limit of A K is called the threshold for 


in 

D 

D 

s 

E 


10 


“2 


10 


-4 


10 



rupture 


TIT 


TIT 


(a) 


<b) 


Figure 8.39 Schematic fracture mechanism maps for (a) fee and (b) bcc materials. 
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Figure 8.40 (a) Increase in stress intensity A K during fatigue; (b) variation of crack growth rate with increasing A K. 


crack growth (AK th ). Clearly, when the stress intensity 
factor is less than A K th the crack will not propagate at 
that particular stress and temperature, and hence A K th 
is of significance in design criteria. If the initial crack 
length is a 0 and the critical length a c , then the number 
of cycles to catastrophic failure will be given by 

ra c 

N f = da/C(AK) m 

JaO 

= [ C da/C[Aa^0*aj] m (8.33) 

JaO 

The mean stress level is known to affect the fatigue 
life and therefore da/dN. If the mean stress level 
is increased for a constant value of A K, K mSLX will 
increase and thus as K max approaches K {c the value 
of da/dN increases rapidly in practice, despite the 
constant value of A K. 

A survey of fatigue fractures indicates there are four 
general crack growth mechanisms: (1) striation forma- 
tion, (2) cleavage, (3) void coalescence and (4) inter- 
granular separation; some of these mechanisms have 
been discussed in Chapter 7. The crack growth 
behaviour shown in Figure 8.40b can be divided into 
three regimes which exhibit different fracture mecha- 
nisms. In regime A, there is a considerable influence 
of microstructure, mean stress and environment on 
the crack growth rate. In regime B, failure generally 
occurs, particularly in steels, by a transgranular ductile 
striation mechanism and there is often little influence 
of microstructure, mean stress or mild environments on 
crack growth. The degree of plastic constraint which 
varies with specimen thickness also appears to have 


little effect. At higher growth rate exhibited in regime 
C, the growth rates become extremely sensitive to both 
micro structure and mean stress, with a change from 
striation formation to fracture modes normally asso- 
ciated with noncyclic deformation, including cleavage 
and intergranular fracture. 
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Chapter 9 


Modern alloy developments 


9.1 Introduction 

In this chapter we will outline some of the devel- 
opments and properties of modern metallic alloys. 
Crucial to these materials have been the significant 
developments that have taken place in manufacturing, 
made possible by a more detailed understanding of 
the manufacturing process itself and of the behaviour 
of the material during both processing and in-service 
performance. Casting techniques in particular have 
advanced much over the past decade and now pro- 
vide reliable clean material with precision. Process 
modelling is developing to the extent that the process 
designer is able to take the micro structural specifi- 
cation for a given composition, which controls the 
properties of the material, and define an optimum man- 
ufacturing route to provide the desired material and 
performance. Modern alloys therefore depend on the 
proper integration of alloy composition and structure 
with processing to produce the desired properties and 
performance. 


9.2 Commercial steels 

9.2.1 Plain carbon steels 

Carbon is an effective, cheap, hardening element for 
iron and hence a large tonnage of commercial 
steels contains very little alloying element. They 
may be divided conveniently into low-carbon 
(<0.3% C), medium-carbon (0.3 -0.7% C) and high- 
carbon (0.7- 1.7% C). Figure 9.1 shows the effect 
of carbon on the strength and ductility. The low- 
carbon steels combine moderate strength with excellent 
ductility and are used extensively for their fabrication 
properties in the annealed or normalized condition 
for structural purposes, i.e. bridges, buildings, cars 
and ships. Even above about 0.2% C, however, the 
ductility is limiting for deep-drawing operations, and 
brittle fracture becomes a problem, particularly for 



Figure 9.1 Influence of carbon content on the strength and 
ductility of steel. 

welded thick sections. Improved low-carbon steels 
(<0.2% C) are produced by deoxidizing or ‘killing’ 
the steel with A1 or Si, or by adding Mn to refine 
the grain size. It is now more common, however, to 
add small amounts (<0.1%) of Nb which reduces 
the carbon content by forming NbC particles. These 
particles not only restrict grain growth but also give 
rise to strengthening by precipitation-hardening within 
the ferrite grains. Other carbide formers, such as Ti, 
may be used but because Nb does not deoxidize, it is 
possible to produce a semi-killed steel ingot which, 
because of its reduced ingot pipe, gives increased 
tonnage yield per ingot cast. 

Medium-carbon steels are capable of being quenched 
to form martensite and tempered to develop toughness 
with good strength. Tempering in higher- temperature 
regions (i.e. 350-550°C) produces a spheroidized car- 
bide which toughens the steel sufficiently for use as 
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axles, shafts, gears and rails. The process of ausform- 
ing can be applied to steels with this carbon content 
to produce even higher strengths without significantly 
reducing the ductility. The high-carbon steels are usu- 
ally quench hardened and lightly tempered at 250°C 
to develop considerable strength with sufficient ductil- 
ity for springs, dies and cutting tools. Their limitations 
stem from their poor hardenability and their rapid soft- 
ening properties at moderate tempering temperatures. 

9.2.2 Alloy steels 

In low/medium alloy steels, with total alloying con- 
tent up to about 5%, the alloy content is governed 
largely by the hardenability and tempering require- 
ments, although solid solution hardening and car- 
bide formation may also be important. Some of these 
aspects have already been discussed, the main con- 
clusions being that Mn and Cr increase hardenabil- 
ity and generally retard softening and tempering; Ni 
strengthens the ferrite and improves hardenability and 
toughness; copper behaves similarly but also retards 
tempering; Co strengthens ferrite and retards soften- 
ing on tempering; Si retards and reduces the volume 
change to martensite, and both Mo and V retard tem- 
pering and provide secondary hardening. 

In larger amounts, alloying elements either open 
up the austenite phase field, as shown in Figure 9.2a, 
or close the y-field (Figure 9.2b). ‘Full’ metals with 
atoms like hard spheres (e.g. Mn, Co, Ni) favour close- 
packed structures and open the y-field, whereas the 


stable bcc transition metals (e.g. Ti, V, Cr, Mo) close 
the field and form what is called a y-loop. The develop- 
ment of austenitic steels, an important class of ferrous 
alloys, is dependent on the opening of the y-phase 
field. The most common element added to iron to 
achieve this effect is Ni, as shown in Figure 9.2a. From 
this diagram the equilibrium phases at lower temper- 
atures for alloys containing 4-40% Ni are ferrite and 
austenite. In practice, it turns out that it is unnecessary 
to add the quantity of Ni to reach the y-phase boundary 
at room temperature, since small additions of other ele- 
ments tend to depress the y/a transformation tempera- 
ture range so making the y metastable at room temper- 
ature. Interstitial C and N, which most ferrous alloys 
contain, also expand the y-field because there are 
larger interstices in the fee than the bcc structure. The 
other common element which expands the y-field is 
Mn. Small amounts (<1%) are usually present in most 
commercial steels to reduce the harmful effect of FeS. 
Up to 2% Mn may be added to replace the more expen- 
sive Ni, but additions in excess of this concentration 
have little commercial significance until 12% Mn is 
reached. Hadfield’s steel contains 12-14% Mn, 1% C, 
is noted for its toughness and its used in railway points, 
drilling machines and rock-crushers. The steel is water- 
quenched to produce austenite. The fee structure has 
good fracture resistance and, having a low stacking- 
fault energy, work-hardens very rapidly. During the 
abrasion and work-hardening the hardening is further 
intensified by a partial strain transformation of the 
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Figure 9.2 Effect of (a) Ni and (b) Cr on y-field (from Smithells, 1967). 
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austenite to martensite; this principle is used also in 
the sheet-forming of stainless steels (see below). 

To make the austenitic steels resistant to oxida- 
tion and corrosion (see Chapter 12) the element Cr 
is usually added in concentrations greater than 12%. 
Chromium closes the y-field, however, and with very 
low carbon contents single-phase austenite cannot 
be produced with the stainless (>12%) composition. 
These alloys form the stainless (ferritic) irons and 
are easily fabricated for use as furnace components. 
Increasing the carbon content expands the y-loop 
and in the medium-carbon range Cr contents with 
good stainless qualities (~15-18%) can be quench- 
hardened for cutlery purposes where martensite is 
required to give a hard, sharp cutting edge. The com- 
bination of both Cr and Ni (i.e. 18/8) produces the 
metastable austenitic stainless steel which is used in 
chemical plant construction, kitchenware and surgi- 
cal instruments because of its ductility, toughness and 
cold- working properties. Metastable austenitic steels 
have good press-forming properties because the strain- 
induced transformation to martensite provides an addi- 
tional strengthening mechanism to work-hardening, 
and moreover counteracts any drawing instability by 
forming martensite in the locally-thinned, heavily- 
deformed regions. 

High-strength transformable stainless steels with 
good weldability to allow fabrication of aircraft and 
engine components have been developed from the 
0.05-0.1% C, 12% Cr, stainless steels by secondary 
hardening addition (1.5-2% Mo; 0.3-0.5% V). Small 
additions of Ni or Mn (2%) are also added to coun- 
teract the ferrite-forming elements Mo and V to make 
the steel fully austenitic at the high temperatures. Air 
quenching to give a followed by tempering at 650°C 
to precipitate Mo 2 C produces a steel with high yield 
strength (0.75 GN/m 2 ), high TS (1.03 GN/m 2 ) and 
good elongation and impact properties. Even higher 
strengths can be achieved with stainless (12-16% Cr; 
0.05% C) steels which although austenitic at room 
temperature (5% Ni, 2% Mn) transform on cooling to 
— 78°C. The steel is easily fabricated at room temper- 
ature, cooled to control the transformation and finally 
tempered at 650- 700° C to precipitate Mo 2 C. 

9.2.3 Maraging steels 

A serious limitation in producing high-strength steels 
is the associated reduction in fracture toughness. Car- 
bon is one of the elements which mostly affects the 
toughness and hence in alloy steels it is reduced 
to as low a level as possible, consistent with good 
strength. Developments in the technology of high-alloy 
steels have produced high strengths in steels with very 
low carbon contents (<0.03%) by a combination of 
martensite and age-hardening, called maraging. The 
maraging steels are based on an Fe-Ni containing 
between 18% and 25% Ni to produce massive marten- 
site on air cooling to room temperature. Additional 


hardening of the martensite is achieved by precipita- 
tion of various intermetallic compounds, principally 
Ni 3 Mo or Ni 3 (Mo, Ti) brought about by the addition 
of roughly 5% Mo, 8% Co as well as small amounts 
of Ti and Al; the alloys are solution heat-treated at 
815°C and aged at about 485°C. Many substitutional 
elements can produce age-hardening in Fe-Ni marten- 
sites, some strong (Ti, Be), some moderate (Al, Nb, 
Mn, Mo, Si, Ta, V) and other weak (Co, Cu, Zr) hard- 
eners. There can, however, be rather strong interactions 
between elements such as Co and Mo, in that the hard- 
ening produced when these two elements are present 
together is much greater than if added individually. 
It is found that A 3 B-type compounds are favoured at 
high Ni or (Ni + Co) contents and A 2 B Eaves phases 
at lower contents. 

In the unaged condition maraging steels have a yield 
strength of about 0.7 GN/m 2 . On ageing this increases 
up to 2.0 GN/m 2 and the precipitation- strengthening is 
due to an Orowan mechanism according to the relation 
o = <7 0 + (afib/L) where <t 0 is the matrix strength, 
a a constant and L the interprecipitate spacing. The 
primary precipitation- strengthening effect arises from 
the (Co + Mo) combination, but Ti plays a double role 
as a supplementary hardener and a refining agent to 
tie up residual carbon. The alloys generally have good 
weldability, resistance to hydrogen embrittlement and 
stress-corrosion but are used mainly (particularly the 
18% Ni alloy) for their excellent combination of high 
strength and toughness. 

9.2.4 High-strength low-alloy (HSLA) steels 

The requirement for structural steels to be welded sat- 
isfactorily has led to steels with lower C (<0.1%) 
content. Unfortunately, lowering the C content reduces 
the strength and this has to be compensated for by 
refining the grain size. This is difficult to achieve with 
plain C-steels rolled in the austenite range but the 
addition of small amounts of strong carbide-forming 
elements (e.g. <0.1% Nb) causes the austenite bound- 
aries to be pinned by second-phase particles and fine- 
grain sizes (<10 pm) to be produced by controlled 
rolling. Nitrides and carbonitrides as well as carbides, 
predominantly fee and mutually soluble in each other, 
may feature as suitable grain refiners in HSEA steels; 
examples include AIN, Nb(CN), V(CN), (NbV)CN, 
TiC and Ti(CN). The solubility of these particles in the 
austenite decreases in the order VC, TiC, NbC while 
the nitrides, with generally lower solubility, decrease 
in solubility in the order VN, AIN, TiN and NbN. 
Because of the low solubility of NbC, Nb is per- 
haps the most effective grain size controller. However, 
Al, V and Ti are effective in high-nitrogen steels, Al 
because it forms only a nitride, V and Ti by forming 
V(CN) and Ti(CN) which are less soluble in austenite 
than either VC or TiC. 

The major strengthening mechanism in HSEA steels 
is grain refinement but the required strength level is 
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obtained usually by additional precipitation strength- 
ening in the ferrite. VC, for example, is more soluble 
in austenite than NbC, so if V and Nb are used in 
combination, then on transformation of the austen- 
ite to ferrite, NbC provides the grain refinement and 
VC precipitation strengthening; Figure 9.3 shows a 
stress -strain curve from a typical HSLA steel. 

Solid- solution strengthening of the ferrite is also 
possible. Phosphorus is normally regarded as deleteri- 
ous due to grain boundary segregation, but it is a pow- 
erful strengthener, second only to carbon. In car con- 
struction where the design pressure is for lighter bodies 
and energy saving, HSLA steels, rephosphorized and 
bake-hardened to increase the strength further, have 
allowed sheet gauges to be reduced by 10-15% while 
maintaining dent resistance. The bake-hardening arises 
from the locking of dislocations with interstitials, as 
discussed in Chapter 7, during the time at the temper- 
ature of the paint-baking stage of manufacture. 

9.2.5 Dual-phase (DP) steels 

Much research into the deformation behaviour of spe- 
ciality steels has been aimed at producing improved 
strength while maintaining good ductility. The con- 
ventional means of strengthening by grain refinement, 
solid- solution additions (Si, P, Mn) and precipitation- 
hardening by V, Nb or Ti carbides (or carbonitrides) 
have been extensively explored and a conventionally 
treated HSLA steel would have a lower yield stress 
of 550 MN m 2 , a TS of 620 MN m 2 and a total 
elongation of about 18%. In recent years an improved 
strength- ductility relationship has been found for low- 
carbon, low-alloy steels rapidly cooled from an anneal- 
ing temperature at which the steel consisted of a 
mixture of ferrite and austenite. Such steels have 
a microstructure containing principally low-carbon, 
fine-grained ferrite intermixed with islands of fine 
martensite and are known as dualphase steels. Typi- 
cal properties of this group of steels would be a TS 
of 620 MN m -2 , a 0.2% offset flow stress of 380 MN 
m -2 and a 3% offset flow stress of 480 MN m -2 with 
a total elongation ~28%. 

The implications of the improvement in mechan- 
ical properties are evident from an examination of 
the nominal stress-strain curves, shown in Figure 9.3. 
The dual-phase steel exhibits no yield discontinuity 
but work-hardens rapidly so as to be just as strong 
as the conventional HSLA steel when both have been 
deformed by about 5%. In contrast to ferrite -pearlite 
steels, the work-hardening rate of dual-phase steel 
increases as the strength increases. The absence of 
discontinuous yielding in dual-phase steels is an advan- 
tage during cold-pressing operations and this feature 
combined with the way in which they sustain work- 
hardening to high strains makes them attractive mate- 
rials for sheet-forming operations. The flow stress and 
tensile strength of dual-phase steels increase as the 



Figure 9.3 Stress-strain curves for plain carbon, HSLA and 
dual-phase steels. 

volume fraction of hard phase increases with a corre- 
sponding decrease in ductility; about 20% volume frac- 
tion of martensite produces the optimum properties. 

The dual phase is produced by annealing in the (a + 
y) region followed by cooling at a rate which ensures 
that the y-phase transforms to martensite, although 
some retained austenite is also usually present leading 
to a mixed martensite -austenite (M-A) constituent. 
To allow air-cooling after annealing, microalloying 
elements are added to low-carbon -manganese -silicon 
steel, particularly vanadium or molybdenum and chro- 
mium. Vanadium in solid solution in the austenite 
increases the hardenability but the enhanced harden- 
ability is due mainly to the presence of fine carboni- 
tride precipitates which are unlikely to dissolve in 
either the austenite or the ferrite at the temperatures 
employed and thus inhibit the movement of the austen- 
ite/ferrite interface during the post-anneal cooling. 

The martensite structure found in dual-phase steels 
is characteristic of plate martensite having internal 
microtwins. The retained austenite can transform to 
martensite during straining thereby contributing to the 
increased strength and work-hardening. Interruption 
of the cooling, following intercritical annealing, can 
lead to stabilization of the austenite with an increased 
strength on subsequent deformation. The ferrite grains 
(~5 pm) adjacent to the martensite islands are gen- 
erally observed to have a high dislocation density 
resulting from the volume and shape change associ- 
ated with the austenite to martensite transformation. 
Dislocations are also usually evident around retained 
austenitic islands due to differential contraction of the 
ferrite and austenite during cooling. 

Some deformation models of DP steels assume both 
phases are ductile and obey the Ludwig relationship, 
with equal strain in both phases. Measurements by sev- 
eral workers have, however, clearly shown a partition- 
ing of strain between the martensite and ferrite, with 
the mixed (M-A) constituent exhibiting no strain until 
deformations well in excess of the maximum uniform 
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strain. Models based on the partitioning of strain pre- 
dict a linear relationship between yield stress, TS and 
volume fraction of martensite but a linear relationship 
is not sensitive to the model. An alternative approach 
is to consider the microstructure as approximating to 
that of a dispersion- strengthened alloy. This would be 
appropriate when the martensite does not deform and 
still be a good approximation when the strain differ- 
ence between the two phases is large. Such a model 
affords an explanation of the high work-hardening rate, 
as outlined in Chapter 7, arising from the interaction 
of the primary dislocations with the dense ‘tangle’ of 
dislocations generated in the matrix around the hard 
phase islands. 

Several workers have examined DP steels to deter- 
mine the effect of size and volume fraction of the 
hard phase. Figure 9.4 shows the results at two dif- 
ferent strain values and confirms the linear rela- 
tionship between work-hardening rate (da/d s) and 
(/ /d) l/2 predicted by the dispersion-hardening the- 
ory (see Chapter 7). Increasing the hard phase volume 
fraction while keeping the island diameter constant 
increases the work-hardening rate, increases the TS 
but decreases the elongation. At constant volume frac- 
tion of hard phase, decreasing the mean island diameter 
produces no effect on the tensile strength but increases 
the work-hardening rate and the maximum uniform 
elongation (Figure 9.5). Thus the strength is improved 
by increasing the volume fraction of hard phase 
while the work-hardening and ductility are improved 
by reducing the hard phase island size. Although 
dual-phase steels contain a complex micro structure 
it appears from their mechanical behaviour that they 
can be considered as agglomerates of non-deformable 
hard particles, made up of martensite and/or bainite 
and/or retained austenite, in a ductile matrix of ferrite. 
Consistent with the dispersion-strengthened model, the 
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Figure 9.4 Dependence of work-hardening rate on ( volume 
fraction f/particle size) 1 / 2 for a dual-phase steel at strain 
values of 0.2 and 0.25 (after Balliger and Gladman, 1981). 


Bauschinger effect, where the flow stress in compres- 
sion is less than that in tension, is rather large in dual- 
phase steels, as shown in Figure 9.6 and increases with 
increase in martensite content up to about 25%. The 
Bauschinger effect arises from the long-range back- 
stress exerted by the martensite islands, which add to 
the applied stress in reversed straining. 

The ferrite grain size can give significant strengthen- 
ing at small strains, but an increasing proportion of the 
strength arises from work-hardening and this is inde- 
pendent of grain- size changes from about 3 to 30 pm. 
Solid solution strengthening of the ferrite (e.g. by sil- 
icon) enhances the work-hardening rate; P, Mn and 
V are also beneficial. The absence of a sharp yield 
point must imply that the dual-phase steel contains a 
high density of mobile dislocations. The microstruc- 
ture exhibits such a dislocation density around the 
martensite islands but why these remain unpinned at 
ambient temperature is still in doubt, particularly as 
strain-ageing is significant on ageing between 423 and 
573 K. Intercritical annealing allows a partitioning of 
the carbon to produce very low carbon ferrite, while 
aluminium- or silicon- killed steels have limited nitro- 
gen remaining in solution. However, it is doubtful 
whether the concentration of interstitials is sufficiently 
low to prevent strain-ageing at low temperature; hence 
it is considered more likely that continuous yielding 
is due to the residual stress fields surrounding second- 
phase islands. Two possibilities then arise: (1) yielding 
can start in several regions at the same time rather 
than in one local region which initiates a general yield 
process catastrophically, and (2) any local region is 
prevented from yielding catastrophically because the 
glide band has to overcome a high back stress from 
the M-A islands. Discontinuous yielding on ageing 
at higher temperatures is then interpreted in terms of 
the relaxation of these residual stresses, followed by 
classical strain- ageing. 

In dual-phase steels the n value ~0.2 gives the 
high and sustained work-hardening rate required when 
stretch formability is the limiting factor in fabrica- 
tion. However, when fracture per se is limiting, dual- 
phase steels probably perform no better than other 
steels with controlled inclusion content. Tensile failure 
of dual-phase steels is initiated either by decohesion 
of the martensite -ferrite interface or by cracking of 
the martensite islands. Improved fracture behaviour is 
obtained when the martensite islands are unconnected, 
when the martensite -ferrite interface is free from pre- 
cipitates to act as stress raisers, and when the hard 
phase is relatively tough. The optimum martensite con- 
tent is considered to be 20%, because above this level 
void formation at hard islands increases markedly. 

9.2.6 Mechanically alloyed (MA) steels 

For strengthening at high temperatures, dispersion 
strengthening with oxide, nitride or carbide particles is 
an attractive possibility. Such dispersion-strengthened 
materials are usually produced by powder processing, 
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Figure 9.5 Effect of second phase particles size d at constant 
(c) tensile strength (after Balliger and Gladman, 1981). 



TOTAL STRAIN 

Figure 9.6 Bauschinger tests for a 0.06%C, 1.5%Mn, 
0.85%Si dual-phase steel (courtesy of D. V. Wilson). 


a special form of which is known as mechanical alloy- 
ing (MA). 

Mechanical alloying is a dry powder, high-energy 
ball-milling process in which the particles of elemen- 
tal or pre-alloyed powder are continuously welded 
together and broken apart until a homogeneous mix- 
ture of the matrix material and dispersoid is pro- 
duced. Mechanical alloying is not simply mixing on 
a fine scale but one in which true alloying occurs. 
The final product is then consolidated by a combina- 
tion of high temperature and pressure (i.e. extrusion of 
canned powder) or hot isostatic pressing (i.e. HIPing). 
Further processing is by thermo-mechanical process- 
ing (TMP) to produce either (1) fine equiaxed grains 
for good room-temperature strength and good fatigue 
strength or (2) coarser, elongated grains to give good 
high-temperature stress -rupture strength and thermal- 
fatigue resistance. 

Various types of ferrous alloy have been made 
by mechanical alloying, including 17%Cr, 7%Ni, 
1.2%A1 precipitation-hardened austenitic martensitic 


volume fraction f on (a) work-hardening rate, (b) elongation and 


steel and Fe-25Cr-6Al-2Y. However, the most 
highly developed material is the 20%Cr, 4.5 %A1 
ferritic stainless steel, dispersion- strengthened with 
0.5% Y 2 0 3 (j MA 956). MA 956 which has been 
made into various fabricated forms has extremely 
good high-temperature strength (0.2% proof strength 
is 200 MN m~ 2 at 600°C, 100 MN m“ 2 at 1000°C and 
75 MN m“ 2 at 1200°C). 

The high-strength capability is combined with exce- 
ptional high-temperature oxidation and corrosion resis- 
tance, associated with the formation of an aluminium 
oxide scale which is an excellent barrier to carbon. No 
carburization occurs in hydrogen -methane mixtures at 
1000°C. Sulphidation resistance is also good. 

MA 956 was originally developed for use in sheet 
form in gas-turbine combustors but, with its combi- 
nation of high strength up to 1300°C, corrosion resis- 
tance and formability, the alloy has found many other 
applications in power stations, including oil and coal 
burners and swirlers, and fabricated tube assemblies 
for fluid-bed combustion. 

9.2.7 Designation of steels 

The original system for labelling wrought steels was 
devised in 1941 and used En numbers. This system 
was replaced in 1976 by the British Standard (BS) 
designation of steels which uses a six-unit system. 
Essentially, it enables the code to express composi- 
tion, steel type and supply requirements. The latter 
is shown by three letters: M means supply to spec- 
ified mechanical properties, H supply to hardenabil- 
ity requirements and A supply to chemical analysis 
requirements. For convenience, steels are divided 
into types; namely, carbon and carbon -manganese 
steels, free-cutting steels, high-alloy steels and alloy 
steels. For example, carbon and carbon -manganese 
steels are designated by mean of Mn/letter/mean of 
C. Thus 080H41 signifies 0.6- 1.0 Mn/hardenability 
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requirement/0.38-0.45 C. Free-cutting steels are des- 
ignated by 200-240/letter/mean of C. Thus 225M44 
signifies free-cutting 0.2-0.3 S/mechanical properties 
requirement/0.4-0.48 C with 1.3 -1.7 Mn. High-alloy 
steels include stainless and valve steels. The desig- 
nation is similar to the AISI system and is given by 
300-499/letters/variants 11-19. Thus 304S15 (pre- 
viously known as Type 304 as used by the AISI) 
signifies 0.06 max. C, 8-11 Ni, 17.5-19 Cr. Alloy 
steels are designated by 500-999/letter/mean of C. 
Thus 500-519 are Ni steels, 520-539 Cr steels, 
630-659 Ni-Cr steels, 700-729 Cr-Mo steels and 
800-839 Ni-Cr-Mo steels. Typically 530M40 signi- 
fies 0.36-0.44 C, 0.9- 1.2 Cr, supplied to mechanical 
properties. 

Tables 9.1 and 9.2 give the compositions of typical 
carbon, alloy and stainless steels. 

9.3 Cast irons 

In the iron-carbon system (Chapter 3) carbon is ther- 
modynamically more stable as graphite than cementite. 

Table 9.1 Compositions of some carbon and alloy steels 


At the low carbon contents of typical steels, graphite 
is not formed, however, because of the sluggishness 
of the reaction to graphite. But when the carbon con- 
tent is increased to that typical of cast irons (2-4% C) 
either graphite or cementite may separate depending 
on the cooling rate, chemical (alloy) composition and 
heat treatment (see Figure 9.7). When the carbon exists 
as cementite, the cast irons are referred to as white 
because of the bright fracture produced by this brit- 
tle constituent. In grey cast irons the carbon exists 
as flakes of graphite embedded in the ferrite -pearlite 
matrix and these impart a dull grey appearance to the 
fracture. When both cementite and graphite are present 
a ‘mottled’ iron is produced. 

High cooling rates, which tend to stabilize the 
cementite, and the presence of carbide-formers give 
rise to white irons. The addition of graphite-forming 
elements (Si, Ni) produces grey irons, even when 
rapidly cooled if the Si is above 3%. These elements, 
particularly Si, alter the eutectic composition which 
may be taken into account by using the carbon equiv- 
alent of the cast iron, given by [total %C + (%Si + 


BS 

designation 

AISI-SAE 

number* 

% C 

% Si 

% Mn 

% Ni 

% Cr 

% others 

040A20 

1020 

0.18-0.23 


0.30-0.50 




080A62 

1060 

0.60-0.65 


0.70-0.90 




070A78 

1080 

0.75-0.82 


0.60-0.80 




150M36 

1340 

0.32-0.40 


1.30-1.70 




212M44 

1140 

0.40-0.48 

0.25 max 

1.00-1.40 



0.12-0.2 S 

527A19 

5120 

0.17-0.22 


0.70-0.90 


0.70-0.90 


665H20 

4620 

0.17-0.23 

0.10-0.35 

0.35-0.75 

1.50-2.00 


0.20-0.30 Mo 

708M40 

4140 

0.36-0.44 

0.10-0.35 

0.70-1.00 


0.90-1.20 

0.15-0.25 Mo 


* Approximately equivalent composition. 


Table 9.2 Compositions and properties of some stainless steels 


Steel 

BS 

designation 

% C 

% Cr 

% Ni 

Others 

Tensile 

strength 

(MN m~ 2 ) 

Yield 
strength 
(MN m~ 2 ) 

% 

elongation 

Condition 

Austenitic 









304 

304S15 

0.08 

18-20 

8.0-10.5 


517 

207 

30 

Annealed 







1276 

966 

9 

Cold-worked 

316 

316S16 

0.08 

16-18 

10-14 

2-3% Mo 

517 

207 

30 

Annealed 

321 

321S12 

0.08 

17-19 

9-12 

Ti (5 x % C) 

586 

241 

55 

Annealed 

347 

347S17 

0.08 

17-19 

9-13 

Nb (10 x % C) 

621 

241 

50 

Annealed 

Ferritic 










430 

430S15 

0.12 

16-18 



448 

207 

22 

Annealed 

Martensitic 









416 

416S21 

0.15 

12-14 


0.60% Mo 

1241 

966 

18 

Quenched 

431 

431S29 

0.20 

15-17 

1.25-2.50 


1379 

1035 

16 

and 










tempered 

Precipitation hardening 








17-4 


0.07 

16-18 

3-5 

0.15-0.45% Nb 

1310 

1172 

10 

Age- 

17-7 


0.09 

16-18 

6.5-7. 8 

0.75-1.25% A1 

1655 

1586 

6 

hardened 
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(b) 

Figure 9.7 Microstructure of cast irons: (a) white iron and 
(b) grey iron (400 x). (a) shows cementite (white) and 
pearlite; (b) shows graphite flakes, some ferrite (white) and 
a matrix of pearlite. 

%P)/3], rather than the true carbon content. Phospho- 
rus is present in most cast irons as a low melting 
point phosphide eutectic which improves the fluidity 
of the iron by lengthening the solidification period; this 
favours the decomposition of cementite. Grey cast iron 
is used for a wide variety of applications because of 
its good strength/cost ratio. It is easily cast into intri- 
cate shapes and has good machinability, since the chips 
break off easily at the graphite flakes. It also has a high 
damping capacity and hence is used for lathe and other 
machine frames where vibrations need to be damped 
out. The limited strength and ductility of grey cast 
iron may be improved by small additions of the car- 
bide formers (Cr, Mo) which reduce the flake size and 
refine the pearlite. The main use of white irons is as a 
starting material for malleable cast iron, in which the 
cementite in the casting is decomposed by annealing. 
Such irons contain sufficient Si (<1.3%) to promote 
the decomposition process during the heat-treatment 


but not enough to produce graphite flakes during cast- 
ing. White-heart malleable iron is made by heating the 
casting in an oxidizing environment (e.g. hematite iron 
ore at 900°C for 3-5 days). In thin sections the carbon 
is oxidized to ferrite, and in thick sections, ferrite at 
the outside gradually changes to graphite clusters in 
a ferrite -pearlite matrix near the inside. Black-heart 
malleable iron is made by annealing the white iron 
in a neutral packing (i.e. iron silicate slag) when the 
cementite is changed to rosette- shaped graphite nod- 
ules in a ferrite matrix. The deleterious cracking effect 
of the graphite flakes is removed by this process and a 
cast iron which combines the casting and machinabil- 
ity of grey iron with good strength and ductility, i.e. 
TS 350 MN m~ 2 and 5-15% elongation is produced. 
It is therefore used widely in engineering and agricul- 
ture where intricate shaped articles with good strength 
are required. 

Even better mechanical properties (550 MN m -2 ) 
can be achieved in cast irons, without destroying the 
excellent casting and machining properties, by the pro- 
duction of a spherulitic graphite. The spherulitic nod- 
ules are roughly spherical in shape and are composed 
of a number of graphite crystals, which grow radially 
from a common nucleus with their basal planes nor- 
mal to the radial growth axis. This form of growth 
habit is promoted in an as-cast grey iron by the addi- 
tion of small amounts of Mg or Ce to the molten 
metal in the ladle which changes the interfacial energy 
between the graphite and the liquid. Good strength, 
toughness and ductility can thus be obtained in cast- 
ings that are too thick in section for malleabilizing 
and can replace steel castings and forgings in certain 
applications. 

Heat-treating the ductile cast iron produces austem- 
pered ductile iron (ADI) with an excellent combination 
of strength, fracture toughness and wear resistance for 
a wide variety of applications in automotive, rail and 
heavy engineering industries. A typical composition 
is 3. 5-4.0% C, 2-2.5% Si, 0.03-0.06% Mg, 0.015% 
maximum S and 0.06% maximum P. Alloying ele- 
ments such as Cu and Ni may be added to enhance 
the heat-treatability. Heat-treatment of the cast ductile 
iron (graphite nodules in a ferrite matrix) consists of 
austenization at 950°C for 1-3 hours during which the 
matrix becomes fully austenitic, saturated with carbon 
as the nodules dissolve. The fully austenized casting 
is then quenched to around 350°C and austempered 
at this temperature for 1-3 hours. The austempering 
temperature is the most important parameter in deter- 
mining the mechanical properties of ADI; high austem- 
pering temperatures (i.e. 350-400°C) result in high 
ductility and toughness and lower yield and tensile 
strengths, whereas lower austempering temperatures 
(250-300°C) result in high yield and tensile strengths, 
high wear resistance and lower ductility and tough- 
ness. After austempering the casting is cooled to room 
temperature. 

The desired microstructure of ADI is acicular ferrite 
plus stable, high-carbon austenite, where the presence 
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Figure 9.8 Microstructure and fracture mode of silicon spheroidal graphite (SG) iron, (a) and (b) as -cast and (c) and 
(d) austempered at 3 50° C for 1 h (L. Sidjanin and R. E. Smallman, 1992; courtesy of Institute of Metals). 


of Si strongly retards the precipitation of carbides. 
When the casting is austempered for longer times 
than that to produce the desired structure, carbides 
are precipitated in the ferrite to produce bainite. Low 
austempering temperatures (~250°C) lead to cementite 
precipitation, but at the higher austempering temper- 
atures (3 00 - 400° C) transition carbides are formed, s 
carbides at the lower temperatures and 37 carbides at the 
higher. With long austempering times the high-carbon 
austenite precipitates ^-carbide at the ferrite -austenite 
boundaries. The formation of bainite does not result in 
any catastrophic change in properties but produces a 
gradual deterioration with increasing time of austem- 
pering. Typically, ADI will have a tensile strength of 
1200-1500 MN m -2 , an elongation of 6-10% and 
Ki c ~ 80 MN m _3/2 . With longer austempering the 
elongation drops to a few per cent and the K\ c reduces 


to 40 MN nrr 3/2 . The formation of ^-carbide at the 
ferrite -austenite boundaries must be avoided since this 
leads to more brittle fracture. Generally, the strength 
is related to the volume fraction of austenite and the 
ferrite spacing. Figure 9.8 shows the micro structure of 
Si spheroidal graphite (SG) iron and the corresponding 
fracture mode. 

9.4 Superalloys 

9.4.1 Basic alloying features 

These alloys have been developed for high-temperature 
service and include iron, cobalt and nickel-based 
materials, although nowadays they are principally 
nickel-based. The production of these alloys over sev- 
eral decades (see Figure 9.9) illustrates the transition 
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Table 9.3 Influence of various alloying additions in superalloys 


Influence 

Cr 

Al 

Ti 

Co 

Mo 

W 

B 

Zr 

C 

Nb 

W 

Ta 

Matrix strengthening 
/-formers 

V 

c 

V 

V 
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■j 




V 


V 

Carbide-formers 

V 
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-j 

V 




V 

V 

V 
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Oxide scale formers 

V 
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V 
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Figure 9.9 Increases in temperature capability for turbine 
blade alloys, based on creep rupture in 1000 h at 
150 MN m~ 2 (from Driver, 1985, pp. 345-54). 


in the development of engineering materials from basic 
alloy composition achievements to a more process- 
dominated control. 

In these alloys / (N 13 AI) and y* (Ni 3 Nb) are the 
principal strengtheners by chemical and coherency 
strain hardening. The ordered /-Ni 3 Al phase is an 
equilibrium second phase in both the binary Ni-Al 
and Ni-Cr-Al systems and a metastable phase in the 
Ni-Ti and Ni-Cr-Ti systems, with close matching 
of the y' and the fee matrix. The two phases have 
very similar lattice parameters (<0.25%, depending 
on composition) and the coherency (interfacial energy 
y 1 & 10-20 mJ m 2 ) confers a very low coarsening 
rate on the precipitate so that the alloy overages 
extremely slowly even at 0.7 T m . In alloys containing 
Nb, a metastable Ni 3 Nb phase occurs but although 
ordered and coherent, it is less stable than y' at high 
temperatures. 

Another source of strengthening is due to solid- 
solution hardening; Cr is a major element, Co may 
be added up to 20% and Mo, W and Ta up to a total 
of 15%. These elements also dissolve in y' so that the 
hardening effect may be twofold. Additions of Cr, like 
Co, also increase the y' solvus and lower the stacking- 
fault energy. 

In high-temperature service, the properties of the 
grain boundaries are as important as the strengthening 
by y' within the grains. Grain boundary strengthen- 
ing is produced mainly by precipitation of chromium 
and refractory metal carbides; small additions of Zr 
and B improve the morphology and stability of these 


carbides. Optimum properties are developed by multi- 
stage heat- treatment; the intermediate stages produce 
the desired grain boundary micro structure of carbide 
particles enveloped in a film of y' and the other stages 
produce two size ranges of / for the best combination 
of strength at both intermediate and high temperatures. 
Table 9.3 indicates the effect of the different alloying 
elements. 

Some of the nickel-based alloys have a tendency to 
form an embrittling a-phase (based on the composition 
FeCr) after long-term in-service applications, when 
composition changes occur removing a-resisting ele- 
ments such as Ni and enhancing a-promoting elements 
such as Cr, Mo or W. This tendency is predicted in 
alloy design by a technique known as Phacomp (phase 
computation) based on Pauling’s model of hybridiza- 
tion of 3d-electrons in transition metals. While a frac- 
tion of the 3d orbitals hybridize with p and s orbitals 
to create the metallic bond, the remainder forms non- 
bonding orbitals which partly fill the electron holes in 
the d-shell, increasing through the transition series to 
give electron hole numbers N v for Cr(4.66), Mn(3.66), 
Fe(3.66), Co(1.71) and Ni(0.66). Computation shows 
that the y/a phase relation depends on the average 
hole number N w given by 

n 

v v = 

i=i 

where m* is the atomic fraction of the ith element 
of electron hole number N Y and n is the number of 
elements in the alloy. The limit of y-phase stability is 
reached at N Y ~2.5. 

9.4.2 Nickel-based superalloy development 

A major application of superalloys is in turbine materi- 
als, jet engines, both disc and blades. Initial disc alloys 
were Inco 718 and Inco 901 produced by conventional 
casting ingot, forged billet and forged disc route. These 
alloys were developed from austenitic steels, which are 
still used in industrial turbines, but were later replaced 
by Waspaloy and Astroloy as stress and tempera- 
ture requirements increased. These alloys were turbine 
blade alloys with a suitably modified heat-treatment for 
discs. However, blade material is designed for creep, 
whereas disc material requires tensile strength cou- 
pled with low cycle fatigue life to cope with the stress 
changes in the flight cycle. To meet these requirements 
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Waspaloy was thermomechanically processed (TMP) 
to give a fine-grain size and a 40% increase in tensile 
strength over the corresponding blade material, but at 
the expense of creep life. Similar improvements for 
discs have been produced in Inco 901 by TMP. More 
highly- alloyed nickel-based discs suffer from exces- 
sive ingot segregation which makes grain size control 
difficult. Further development led to alloys produced 
by powder processing by gas atomization of a molten 
stream of metal in an inert argon atmosphere and con- 
solidating the resultant powder by HIPing to near-net 
shape. Such products are limited in stress application 
because of inclusions in the powder and, hence, to 
realize the maximum advantage of this process it is 
necessary to produce ‘ superclean ’ material by electron 
beam or plasma melting. 

Improvements in turbine materials were initially 
developed by increasing the volume fraction of y' 
in changing Nimonic 80 A up to Nimonic 115. Unfor- 
tunately, increasing the (Ti + Al) content lowers the 
melting point, thereby narrowing the forging range 
which makes processing more difficult. Improved 
high-temperature oxidation and hot corrosion perfor- 
mance has led to the introduction of aluminide and 
overlay coatings and subsequently the development of 
IN 738 and IN 939 with much improved hot-corrosion 
resistance. 

Further improvements in superalloys have depended 
on alternative manufacturing routes, particularly using 
modern casting technology. Vacuum casting was first 
used to retain high Ti and Al contents without oxi- 
dation loss. With 9-11% (Ti + Al), a 70% volume 
fraction of y' has been produced in IN 100 ( Nimocast 
PK 24) which does not require supplementary solid 
solution strengtheners and therefore gives a saving in 
density. 2 

Additions of high melting point elements such as W 
extend the high-temperature capabilities at the expense 
of density. M200 contains 12% W and 1% Nb but has 
limited ductility around 760° C which can be improved 
by additions of hafnium. The significant improvement 
in ductility and reduced porosity produced by Hf 
has led to its addition to other alloys (e.g. Mar 001 
(. IN 100 + Hf) and Mar 004 (IN 713 + Hf) and M002 
which contains 10% W, 2.5% Ta and 1.5% Hf). 

Creep failures are known to initiate at transverse 
grain boundaries and, hence, it is reasonable to aim 
to eliminate them in the turbine blade to gain further 
improvement in performance. Technologically this was 
achieved by directional solidified (DS) castings with 
columnar grains aligned along the growth direction 
with no grain boundaries normal to that direction. By 


incorporating a geometric constriction into the mould 
or by the use of a seed crystal it has been possible to 
eliminate grain boundaries entirely and grow the blade 
as one single crystal (see Chapter 3). 

The elimination of grain boundaries immediately 
removes the necessity for adding grain-boundary 
strengthening elements, such as C, B, Zr and Hf. to 
the superalloy. The removal of such elements raises 
the melting point and allows a higher solution heat- 
treatment temperature with consequent improvement in 
chemical homogeneity and more uniform distribution 
of y' precipitates. Particularly important, however, 
is the control of the growth direction along the 
[100] direction. The [100] alignment along the axis 
of the blade gives rise to an intrinsic high creep 
which enables thermal stresses caused by temperature 
gradients across the blade to be minimized. 

Single-crystal blades have now been used success- 
fully for both civil and military engines, SSR99 replac- 
ing Mar M002 but with improved tensile, creep and 
fatigue properties and a lower-density alloy RR2000 
replacing IN 100. 

9.4.3 Dispersion-hardened superalloys 

All / -hardened alloys experience a reduction in 
strength at elevated temperatures because of the 
solution of y' precipitate. To produce improved high- 
temperature strength, alloys hardened by oxides (ODS 
or oxide dispersion- strengthened), particularly thoria, 
have been developed. TD nickel (i.e. thoria dispersion- 
strengthened nickel or nickel with 2% Th0 2 ) and TD- 
Nichrome (i.e. nickel-20% Cr-2% Th0 2 ) is produced 
by mixing thoria sols with nickel-containing solutions 
to ensure a good dispersion. Drying the oxide powder 
mixture, followed by a hydrogen reduction process, 
produces a fine composite nickel -thoria powder. 
Compaction and controlled TMP during extrusion 
and rolling develops the structure and strength. 
These materials have excellent high-temperature creep 
resistance, but poor low-temperature properties which 
precludes aero-engine applications, although some 
other applications have been found. It is not possible 
to add the /-forming elements via the above process 
to produce lower-temperature strengthening because 
aluminium and titanium oxide are not reduced by 
hydrogen. 

Mechanical alloying (MA) is a dry powder process 
and overcomes this problem. MA, a high-energy ball- 
milling process, produces a homogeneous mixture of 
the matrix material and dispersoid. The final product 


1 Composition 


(wt %) 

Ni 

Cr 

Co 

Ti 

Al 

Mo 

C 

Zr 

B 

Others 

Waspaloy 

Balance 

19.5 

13.5 

3.0 

1.3 

— 

0.08 

0.06 

0.006 


Astroloy 

Balance 

15.0 

17.0 

3.5 

4.0 

5.25 

0.06 

— 

0.030 


Inco 718 

Balance 

19.0 

3.0 

0.9 

0.6 

3.0 

0.04 

— 

— 

Fe 20 Nb 5.2 

Inco 901 

42 

13.0 

— 

3.0 

0.3 

5.7 

0.04 

— 

— 

Bal. Fe 


2 A Pratt and Whitney version of IN 100 (B1900) replaced Ti with Ta to improve the castability. 
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Figure 9.10 Comparison of MA 6000 with other 
high-strength nickel alloys: (A) TD-Nickel, (B) MA 6000, 

(C) directionally-solidified Mar-M200 and (D) single-crystal 
PWA 454. 

is then produced by TMP with the grain structure on 
recrystallization elongated in the direction of working. 

These materials are generally anisotropic in their 
properties but a range of MA materials is emerging 
with more acceptable dispersoids. MA754 is a 
Nichrome matrix with Y 2 O 3 dispersoids and MA6000 
is a more complex nickel superalloy-based-yttria 
material (Ni-12Cr-2Mo-4W-2Ta-4.5Al-2.5Ti and 
I.IY 2 O 3 ) suitable for turbine blades, having the 
creep characteristics of the y' strengthened materials 
at low temperatures combined with the advantages 
of dispersion-strengthening at high temperature 
(see Figure 9.10). TMP limits the amount of y' 
strengthening that can be introduced and while 
MA6000 can be run at higher temperatures than 
conventional nickel-based superalloys, application is 
then limited to areas where the loadings can be 
kept low. 

9.5 Titanium alloys 

9.5.1 Basic alloying and heat-treatment 
features 

Since the emergence of titanium as a ‘wonder metal’ 
in the 1950s the titanium industry has developed 


a wide range of alloys with different compositions 
(see Figure 9.11 ). 1 These alloys rely on the high 
strength/weight ratio, good resistance to corrosion, 
combined low thermal conductivity and thermal expan- 
sion of titanium, properties which make it attractive 
for aerospace applications in both engine and airframe 
components. 

Titanium exists in the cph a form up to 882°C and 
then as bcc ft to its melting point. Alloying additions 
change the temperature at which the a to yd tran- 
sition takes place, solutes that raise the transus are 
termed o'- stabilizers and those that lower the yd-transus 
temperature are termed yd- stabilizers (Figure 9.12). The 
predominant a-stabilizer is aluminium. It is also an 
effective ^-strengthening element at ambient and ele- 
vated temperatures up to 550°C and thus a major con- 
stituent of most commercial alloys. The low density 
of aluminium is an important additional advantage, 
a-phase strengthening is also achieved by additions 
of tin and zirconium. These metals exhibit extensive 
solubility in both a and ft titanium but have little influ- 
ence on the yd-transus and are thus regarded as neutral 
additions, yd- stabilizers may be either yd-isomorphous 
(i.e. have the bcc structure like yd -Ti) or yd-eutectoid 
elements, yd-isomorphous elements have a limited a- 
solubility and are completely soluble in yd-titanium, 
typical additions being molybdenum, vanadium and 
niobium. In contrast, yd-eutectoid elements have a 
restricted solubility in yd-titanium and form intermetal- 
lic compounds by eutectoid decomposition of the yd- 
phase. In some alloy systems containing yd-eutectoid 
elements, such as silicon or copper, the compound for- 
mation (i.e., respectively, Ti 5 Si 3 and Ti 2 Cu) leads to an 
improvement in mechanical properties. Titanium will 
also take interstitial solutes in solid solution, hydro- 
gen being a yd-stabilizer while carbon, nitrogen and 
oxygen are strong o'-stabilizers. To minimize gas in Ti 
leads to a high cost of manufacture and heat-treatment 
requires vacuum or inert gas conditions and freedom 
from refractory contact. 

In describing titanium alloys it is conventional to 
classify them in terms of the microstructural phase ( a - 
alloys, yd-alloys, ( a + yd)-alloys or near-a- alloys, i.e. 
predominantly a-phase but with a small volume of 
yd-phase). Commercial alloys are usually heat-treated 
to optimize the mechanical properties by controlling 
the transformation of the yd- to a-phase, the extent of 
which is governed by the alloy composition and the 
cooling rate. The a-alloys can transform completely 
from the yd- to a-phase no matter what the cooling 
rate. Such treatments have a negligible effect on prop- 
erties and a-alloys tend to be used in the annealed 
state. 

! The Larson-Miller parameter 0 is given by 0 = T(A + 
logj 0 1), where T is the temperature in degrees Kelvin, t the 
time in hours and A a constant, and defines the conditions 
to produce a given amount of plastic strain (e.g. 0.2%). 
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Figure 9.11 Plot of stress versus Larson-Miller parameter <f> for a range of titanium alloys. 
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Figure 9.12 Representative phase diagrams for Ti-alloys. (a) Ti-V. (b) Ti-Al and (c) Ti-Cu ( after Smithells, 1992). 




310 Modern Physical Metallurgy and Materials Engineering 


Rapidly-cooled alloys containing /3- stabilizers form 
martensitic a from the /3-phase, whereas slower cool- 
ing rates favour a formation by a nucleation and 
growth process. Several morphologies of a can be pro- 
duced by controlling the nucleation and growth mech- 
anism; slow cooling, for example, tends to produce 
similarly aligned a platelets in colonies, combined with 
primary a at the grain boundaries. Faster cooling and 
higher o'-stabilizer contents result in a basket-weave 
micro structure. Metastable-/3, when aged, precipitates 
fine o', giving increased strength. The less stable the /3, 
the more a can be precipitated and hence the higher 
the strength attained. 

^ -stabilizing elements improve strength by strength- 
ening the /3-phase. The microstructure consists of pri- 
mary a combined with the yd-phase, which can be 
strengthened by an ageing treatment to precipitate 
acicular-o'. Further strengthening is achieved by the 
limited solubility of the /3 -stabilizing element. Gener- 
ally, these alloys have poor ductility properties. 

The most important alloys contain both a- and /3- 
stabilizers which, after working and annealing, give 
good strength and fabrication properties. For good 
creep strength an o'-titanium base, strengthened as 
much as possible by solute elements, is required. To 
meet this requirement the near-of alloys have been 
developed. These alloys combine the high a stability 
with sufficient yd- stabilizer to give adequate strength. 
By yd heat-treating, the ( a + yd) microstructure changes 
to a totally transformed yd structure containing basket- 
weave a. These alloys have good creep resistance and 
reasonable room- temperature properties. The basket- 
weave morphology is effective in inhibiting crack 
growth, and near-a alloys exhibit lower crack prop- 
agation rates than the o' + yd microstructures. Most of 
the commercial alloys which have been recently devel- 
oped are of this type. The major factor influencing 
the post-forging microstructure is the cooling rate; an 
oil quench results in a basket-weave structure (see 
Figure 9.13a) and an air cool gives a typically aligned 


micro structure (see Figure 9.13b). The alloy is usually 
stress-relieved by annealing for 2 hours at 625 °C or 
above. 

9.5.2 Commercial titanium alloys 

a alloys transform entirely to a on cooling from 
the /3-phase, regardless of cooling rate. Commercially 
pure titanium with a nominal oxygen content of 500, 
1000, 1550 and 2700 ppm, respectively, gives tensile 
properties which range from 450 to 640 MN nrT 2 
(see Figure 9.14). These are used mainly in sheet 
form. Solid- solution strengthening by aluminium, tin 
or zirconium increases tensile and creep strength and 
IMI 317 (Ti-5Al-2.5Sn) is typical. 

Increasing the cc-stabilizing composition increases 
creep strength but makes fabrication more difficult and 
can lead to embrittlement during prolonged exposure 
at temperature in service due to the formation of the 
coherent ordered phase a 2 (Ti 3 Al). To avoid this it was 
empirically established that the aluminium equivalent 
Al* must be no greater than 9 where Al* (in wt %) is 
given by: 

Al* = A1+ ^ ^ + (10 x 0 2 ) 

3 6 

IMI 31 7 is difficult to fabricate and is often replaced 
by an IMI 230, which is an a-phase alloy containing 
the precipitation-hardening phase Ti 2 Cu; it can be 
fabricated and welded and has good strength and 
ductility up to 400° C. 

/3 alloys contain enough /3- stabilizing elements 
to maintain the bcc /3-phase to room temperature. 
Unfortunately, bcc /3-Ti alloys are prone to 
embrittlement. The binary alloys, titanium with 
Fe, Cr, Mn, Nb, Mo, Cr or V, all precipitate 
the embrittling co-phase. More complex /3-Ti 
alloys containing Cr also suffer embrittlement from 
TiCr 2 . More stable alloys have been developed 
(e.g. (Ti-ll.5Mo-6Zr-4.5Sn)) but are little used. 




Figure 9.13 Microstructure ofnear-a titanium alloy ( IMI 829 ) initially ft- heat-treated at 1050° C for 1 h. (a) Oil quenched and 
(b) air cooled ( after Woodfield, Loretto and Smallman, 1988). 
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Figure 9.14 Variation of tensile strength with temperature for a range of commercial titanium alloys. 


However, /3-phase alloys, such as Ti-10V-2Fe-3Al, 
have potential as airframe construction materials 
offering high strength (1250 MN m -2 ) and toughness 
(45 MN nr 3 / 2 ) in relatively thick cross-sections 
(90 mm) and superior hot-working characteristics 
which are attractive in expensive forging operations. 

( a + P) alloys are probably the most widely used 
titanium alloys and contain alloying additions which 
strengthen both phases. These alloys are thermo- 
mechanical processed to control the size, shape and 
distribution of both a and /3. The most versatile 
( a + /3) alloy is IMI 318 , which contains 6% A1 and 
4% V; it can be used at temperatures up to 350°C and 
has good forging and fabrication properties. It initially 
replaced steel as a disc material in jet engines leading 
to 20% weight saving. Another important ( a + P) alloy 
is IMI 550 (Ti-6Al-2Sn-4Mo-0.5Si) which has 
higher strength and good creep resistance up to 400° C. 
(i a + P) alloys remain the principal materials for fan 
discs and blades and for low-and intermediate-pressure 
compressor discs and blades of current gas turbine 
engines, (a + P) alloys with extra low interstitial (ELI) 
content are attractive as ‘damage-tolerant’ materials 
for critical airframe components. Ti-6A1-4V with low 
oxygen has a tensile strength 8% lower than the stan- 
dard alloy but, more importantly, the minimum fracture 
toughness is 60 MN m _3/2 . 

Near-ot-alloys have increased the strength and the 
volume of the more creep-resistant a-phase at the 


expense of the bcc /3-phase, which imports good low- 
temperature strength and forgeability. IMI 685 was the 
first titanium alloy to operate above 500°C. It contains 
Ti-6Al-5Zr-0.5Mo-0.2Si with A1 and Zr instead of 
Sn as o'-stabilizers, reduced Mo, the /3- stabilizer, to 
minimize ft at the a-needles and Si to improve creep 
resistance. These alloys are worked and heat-treated 
in the /3-range and have a tensile strength of about 
1000 MN m -2 and give less than 0.1% creep strain in 
100 h under a stress of 310 MN m -2 at 520°C. IMI 
829 (Ti-5.6Al-3.5Sn-3Zr-lNb-0.25Mo-0.3Si) has 
been derived from IMI 685 by replacing some of the 
Zr with the more potent strengthener Sn. It is ^-heat- 
treated and has sufficient higher-temperature capabil- 
ity to be used in the hotter regions of engines. IMI 
834 (Ti-5.8Al-4Sn-3.5Zr-0.7Nb-0.5Mo-0.35Si- 
0.06C) has been developed for use up to 600° C and 
combines the high fracture toughness and crack propa- 
gation resistance of a transformed /3-structure with the 
typical equiaxed structure of the a + /3 alloys, providing 
good fatigue resistance and ductility. The small addition 
of carbon allows a controlled high a/p heat- treatment. 
Hot-working is carried out in the a + ft field and heat- 
treatment involves solution treatment for 2 hours at 
1025°C, consistent with about 15% primary a , followed 
by oil quenching prior to ageing for 2 hours at 700° C, 
then air cooling. With such good high- temperature prop- 
erties the alloy is being specified for engine compressor 
applications. 
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9.5.3 Processing of titanium alloys 

Some of the titanium alloys have excellent superplastic 
forming characteristics and fabricating manufacturers 
have taken advantage of these properties in developing 
new processing technologies. Fine-grained thin sheets 
of IMI 318 have been superplastically deformed at 
900° C under slow strain-rate conditions to produce a 
variety of complex parts. In conjunction with diffusion 
bonding, weight-saving of about one-third has been 
achieved in, for example, the wing access panels of 
the A320 Airbus. Near-a-alloys have also been shown 
to exhibit superplastic behaviour. 

Casting technology has also been developed. Invest- 
ment casting of IMI 318 is widely used but there is 
now increasing interest in the use of high-temperature 
alloy castings as weight-saving alternatives to steel and 
nickel alloys above 500°C. The introduction of HIPing 
for titanium castings has widened their potential appli- 
cation. For IMI 829, the best properties are obtained 
in the a /ft HIP and solution-treated and aged condi- 
tion, generally meeting the specification for the alloy 
in the wrought condition. Rapid solidification process- 
ing may also offer possibilities by incorporating Er 2 0 3 
and other rare earth oxide dispersions into the titanium 
matrix. At present, the control of these dispersoids 
remains a problem. 

To develop higher-performance materials than the 
near-of alloys into the 700-800°C range, attention 
is being given to strengthening high-temperature 
alloy materials, including titanium aluminides (see 
Section 9.6) with ceramic fibres such as SiC and B 4 C. 

9.6 Structural intermetallic 
compounds 

9.6.1 General properties of intermetallic 
compounds 

In terms of their properties, intermetallic compounds 
are generally regarded as a class of materials between 
metals and ceramics which arises from the bonding 
being a mixture of metallic and covalent. Intermetallic s 
are intrinsically strong (and in the Ll 2 -ordered fee 
compounds increases with temperature up to about 
600° C) with high elastic modulus. The strong bond- 
ing and ordered structure also gives rise to lower 


self-diffusion coefficients and hence greater stability 
of diffusion-controlled properties. Some of the com- 
pounds of current interest are shown in Table 9.4. 
Intermetallic s containing aluminium or silicon exhibit 
a resistance to oxidation and corrosion because of their 
adherent surface oxides. Those based on light elements 
have attractive low density giving rise to high spe- 
cific properties particularly important in weight-saving 
applications. 

Like ceramics, however, the greatest disadvantage 
of intermetallic s is their low ductility, particularly at 
low and intermediate temperatures. The reasons for 
the lack of ductility vary from compound to com- 
pound but include (1) a limited number of easy defor- 
mation modes to satisfy the von Mises criterion, 

(2) operation of dislocations with large slip vectors, 

(3) restricted cross-slip, (4) difficulty of transmitting 
slip across grain boundaries, (5) intrinsic grain bound- 
ary weakness, (6) segregation of deleterious solutes 
to grain boundaries, (7) covalent bonding and high 
Peierls-Nabarro stress and (8) environmental suscep- 
tibility. It has been demonstrated, however, that some 
intermetallic s can be ductilized by small alloying addi- 
tions: Ni 3 Al with boron, TiAl with Mn, Ti 3 Al with Nb. 
This observation has encouraged recent research and 
development of intermetallic s and the possibility of 
application of those materials. 

9.6.2 Nickel aluminides 

A/7 3 A/ (i nickel aluminide ) is the ordered fee /' phase 
and is a major strengthening component in superalloys. 
Ni 3 Al single crystals are reasonably ductile but in 
poly crystalline form are quite brittle and fail by inter- 
granular fracture at ambient temperatures. The basic 
slip system is {1 1 1 } (1 10) and has more than five 
independent slip modes but still exhibits grain bound- 
ary brittleness. Remarkably, small additions of ~0.1 
at.% boron produce elongations up to 50%. Gen- 
eral explanations for this effect are that B segregates 
to grain boundaries and (1) increases the cohesive 
strength of the boundary and (2) disorders the grain 
boundary region so that dislocation pile-up stresses 
can be relieved by slip across the boundary rather 
than by cracking. This general explanation is no 
doubt of significance but additionally, there are dis- 
tinct micro structural changes within the grains which 


Table 9.4 Comparison of physical properties of some intermetallic compounds 


Compound 

Crystal structure 

Melting 
temp. (°C) 

Density 
(kg m~ 3 ) 

Young's 

modulus/density 

Ni 3 Al 

Ll 2 (ordered fee) 

1400 

7500 

45 

Ni 3 Si 

Ll 2 (ordered fee) 

1140 

7300 


NiAl 

B 2 (ordered bee 

1640 

5860 

35 

Ti 3 Si 

DO 19 (ordered eph) 

1600 

4200 

50 

TiAl 

Llo (ordered tetragonal) 

1460 

3910 

24 

FeAl 

B 2 (ordered bee) 

1300 

5560 

47 
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must lead to a reduced friction stress and ease the 
operation of poly slip. For example, the addition of B 
reduces the occurrence of stacking-fault defects. Addi- 
tion of solutes, such as B, are not expected to raise 
the stacking-fault energy and hence this effect possi- 
bly arises from the segregation of B to dislocations, 
preventing the superdislocation dissociation reactions 
(see Chapter 4, Section 4.6.5). 

Microhardness measurements inside grains and 
away from grain boundaries indeed show that boron 
softens the grains. The ductilization effect is limited 
to nickel-rich aluminides and cannot be produced by 
carbon or other elements, although some substitutional 
solutes such as Pd, which substitutes for Ni, and Cu 
produce a small improvement in elongation. Small 
additions of Fe, Mn and Hf have also been claimed 
to improve fabric ability. Grain size has been shown to 
influence the yield stress according to the Hall-Petch 
equation and B appears to lower the slope k y and 
facilitate slip across grain boundaries. These alloys are 
also known to be environmentally sensitive. Hf, for 
example, which does not segregate to grain boundaries 
but still improves ductility, has a large misfit (11%) 
and possibly traps H from environmental reactions, 
such as A1 + H 2 0 -> A1 2 0 3 + H. Ti, which has a 
small misfit, does not improve the ductility. 

The most striking property of Ni 3 Al is the increas- 
ing yield stress with increasing temperature up to 
the peak temperature of 600°C (see Figure 9.15). 
This behaviour is also observed in other Ll 2 inter- 
metallics, particularly Ni 3 Si and Zr 3 Al. This effect 
results from the thermaily- activated cross-slip of screw 
dislocations from the {111} planes to the {100} cube 



Figure 9.15 Effect of aluminium content on the 
temperature-dependence of the flow stress in NisAl (after 
Noguchi, Oya and Suzuki, 1981). 


Table 9.5 Anti-phase boundary energies in Ni 3 Al 


Alloy 

yin ( mJ/m 2 ) 

yioo (mJ/m 2 ) 

Yin / Yioo 

Ni-23.5A1 

183 ± 12 

157 ± 8 

1.17 

Ni-24.5A1 

179 ± 15 

143 ± 7 

1.25 

Ni-25.5A1 

175 ± 13 

134 ± 8 

1.31 

Ni-26.5A1 

175 ± 12 

113 ±10 

1.51 

Ni-23.5A1 + 0.25B 

170 ± 13 

124 ± 8 

1.37 


planes where the apb-energy is somewhat lower. The 
glide of super-dislocations is made more difficult by 
the formation of Kear-Wilsdorf (K-W) locks (see 
Chapter 7) and their frequency increases with tem- 
perature. Electron-microscopy measurements of apb 
energies given in Table 9.5 shows that the apb energy 
on {100} decreases with aluminium content and this 
influences the composition-dependence of the strength, 
shown in Table 9.15. The cross-slip of screw disloca- 
tions from the {111} planes to cube planes also gives 
rise to a high work-hardening rate. 

Although the study of creep in /-based materials is 
limited, it does appear to be inferior to that of super- 
alloys. Above 0.6T m creep displays the characteristic 
primary and secondary stages, with steady-state creep 
having a stress exponent of approximately 4 and an 
activation energy of around 400 KJ mol -1 , consistent 
with climb being the rate-controlling process. At inter- 
mediate temperatures (i.e. around the 600° C peak in 
the yield stress curve) the creep behaviour does not 
display the three typical stages. Instead, after primary 
creep, the rate continuously increases with creep strain, 
a feature known as inverse creep. In primary creep, 
planar dissociation leads to an initial high creep rate 
which slows as the screws dissociate on {100} planes to 
form K-W locks. However, it is the mobile edge dis- 
locations which contribute most to the primary creep 
strain and their immobilization by climb dissociation 
which brings about the exhaustion of primary creep. 
The inverse creep regime is still not fully researched 
but could well be caused by glide on the {100} planes 
of the cross-slipped screw components. 

The fatigue life in high-cycle fatigue is related to 
the influence of temperature on the yield stress and 
is invariant with temperature up to about 800° C, but 
falls off for higher temperatures with cracks propagat- 
ing along slip planes. With boron doping the fatigue 
resistance is very sensitive to aluminium content and 
decreases substantially as A1 increases from 24 to 26 
at.%. Nevertheless, crack growth rates of Ni 3 Al + B 
are lower than for commercial alloys. 

Hyperstoichiometric Ni 3 Al with boron can be pre- 
pared by either vacuum melting and casting or from 
powders by HIPing. Fabrication into sheets is possible 
with intermediate anneals at 1000°C. At present, how- 
ever, the application of Ni 3 Al is not significant; Ni 3 Al 
powders are used as bond coats to improve adher- 
ence of thermal spray coatings. Nevertheless, Ni 3 Al 
alloys have been tested as heating elements, diesel 
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Table 9.6 Comparison of super a 2 and y alloys with conventional titanium alloys 


Property 

Titanium alloys 

(«2 + P) 

(y + oc 2 ) 

Density (g cm -3 ) 

4.54 

4.84 

4.04 

E, stiffness (GN m -2 ) 

110 

145 

176 

RT tensile strength (MN m -2 ) 


1100 

620 

HT (760° C) tensile strength (MN m -2 ) 


620 

550 

Max. creep temp. (°C) 

540 

730 

900 

RT ductility (%) 

20 

4-6 

3 

Service temp, ductility (%) 

high 

5-12 

5-12 


engine components, glass-making moulds and hot- 
forging dies, slurry-feed pumps in coal-fired boilers, 
hot-cutting wires and rubber extruders in the chemical 
industry. Ni 3 Al-based alloys as matrix materials for 
composites are also being investigated. 

Nickel aluminide (NiAl) has a caesium chloride or 
ordered yd -brass structure and exists over a very wide 
range of composition either side of the stoichiomet- 
ric 50/50 composition. It has a high melting point of 
1600°C and exhibits a good resistance to oxidation. 
Even with such favourable properties it has not been 
commercially exploited because of its unfavourable 
mechanical properties. Because it is strongly ordered, 
low-temperature deformation occurs by an a (100) dis- 
location vector and not by a/2(\ 1 1) super-dislocations. 
{110} (100) slip therefore leads to insufficient slip 
modes to satisfy the general plasticity criterion and 
in the poly crystalline condition /3-NiAl is extremely 
brittle. The ductility does improve with increasing 
temperature but above 500° C the strength drops off 
considerably as a result of extensive glide and climb. 
Improvements in properties are potentially possible by 
refinement of the grain size and by using alloying addi- 
tions to promote (111) slip, as in FeAl, which has 
the same structure. In this respect, additions of Fe, Cr 
or Mn appear to be of interest. For high-temperature 
applications, ternary additions of Nb and Ta have been 
shown to improve creep strength through the precipi- 
tation of second phases and mechanical alloying with 
yttria or alumina is also beneficial. 

A further commercial problem of this material is 
that conventional production by casting and fabrication 
is difficult, but production through a powder route 
followed by either HIPing or hot extrusion is more 
promising. 

9.6.3 Titanium aluminides 

Because of the limited scope for improvements in 
the properties of conventional titanium alloys above 
650°C, either by alloy development or by TMP, 
increased attention is being given to the titanium inter- 
metallic s, Ti 3 Al ((^ 2 -phase) and TiAl (y-phase). With 
low density, high modulus and good creep and oxi- 
dation resistance up to 900° C they have considerable 
potential if the poor ductility at ambient temperatures 


could be improved. A comparison of Ti 3 Al- and TiAl- 
based materials with conventional Ti-alloys is given in 
Table 9.6. 

Electron microscopy studies of Ti^Al or a 2 have 
shown that deformation by slip occurs at room tem- 
perature by coupled pairs of dislocations with b = 
1/6(1 120) which glide only on {1 0 1 0} planes and by 
very limited glide on {1 121} with pairs of dislocation 
with b = 1/6(112 6). The ductility increases at higher 
temperatures due to climb of the (1 12 0) dislocations 
and to the increased glide mobility of 1/6(1120) 
and 1/6(112 6) dislocations through thermal activa- 
tion. Only limited activity of the {0001} (1120) slip 
systems is observed, even at high temperatures. 

The most successful improvements in the ductil- 
ity of Ti 3 Al have been produced by the addition of 
^ -stabilizing elements, particularly niobium, to pro- 
duce o' 2 -alloys. An addition of 4 at.% Nb produces 
significant slip on {1 0 1 0} (1 1 20), {0001} (1 120) and 
{1 1 2 1}(1 1 2 6) as well as some slip on {1011} (1120). 
This improvement is attributed to the decrease in 
covalency as Nb substitutes for Ti with a conse- 
quent reduction in the Peierls-Nabarro friction stress. 
Alloys based on a 2 are Ti-(23-25) Al-(8-18)Nb, 
of which Ti-24Al-llNb has excellent spalling resis- 
tance. Most Ti 3 Al + Nb alloys, such as super a 2 , 
also contain other yd- stabilizers including Mo and V, 
i.e. Ti-25Al-10Nb-3V-lMo, which exhibits about 
7% room temperature elongation. Alloying Ti 3 Al with 
yd -stabilizing elements to produce two-phase alloys 
significantly increases the fracture strength. These 
Ti 3 Al-based alloys can be plasma-melted and cast 
followed by TMP in the (a 2 + yd) or yd-range. The 
improved ductility of Ti 3 A1 alloys has led to aerospace 
applications in after-burners in jet engines where it 
compares favourably in performance with superalloys 
and gives a 40% weight- saving. 

Developments are taking place in rapid solidification 
processing to include a second phase (e.g. rare-earth 
precipitates) and to provide powders, which may be 
consolidated by HIPing, to produce fully dense compo- 
nents with properties comparable to wrought products. 
There are also developments in intermetallic matrix 
composites by the addition of SiC or A1 2 0 3 fibres 
(~10 pm). These have some attractive properties, but 
the fibre/intermetallic interface is still a problem. 
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Figure 9.16 Structure of (a) TiAl ( Llo ) and (b) ( 111 ) plane showing slip vectors for possible dissociation reactions, e.g. 
ordinary dislocations 1/2[110], super dislocations [Oil ] and 1 12 [112], and twin dislocations 1 /6[1 12 ] (after Kim and Froes, 
1990). 


The y -phase Ti-(50-56)Al has an ordered fc tetrag- 
onal (Ll 0 ) structure up to the m.p. 1460°C. with c/a — 
1.02 (Figure 9.16). Deformation by slip occurs on 
{111} planes and, because of the tetragonality, there are 
two types of dislocations, namely ordinary dislocations 
1/2(110) and superdislocations (Oil) = 1/2(011) + 
1/2(011). Another superdislocation 1/2(112) has also 
been reported. 

At room temperature, deformation occurs by both 
ordinary and super dislocations. However, [Oil] and 
[101] super dislocations are largely immobile because 
segments of the trailing superpartials 1/6(1 12) -type 
form faulted dipoles. The dissociated 1/2(110) dislo- 
cations bounding complex stacking faults are largely 
sessile because of the Peierls-Nabarro stress. Some 
limited twinning also occurs. The flow stress increases 
with increasing temperature up to 600° C as the super- 
partials become mobile and cross-slip from {111} to 
{100} to form K-W-type locks, the 1/2(110) slip 
activity increases and twinning is promoted. 

The two-phase (y + a 2 ) Ti-Al alloys have better 
ductility than single-phase y with a maximum at 48 
at.% Al. This improvement has been attributed to the 
reduced c/a with decreased Al, further promotion of 
twinning and the scavenging of 0 2 and N 2 interstitials 
by of 2 . Additions of V, Cr or Mn, which reduce the unit 
cell, also improve ductility. However, the two-phase 
micro structure is extremely important in optimizing 
the tensile and creep properties and the fracture tough- 
ness. Both the volume fraction and morphology of 
the lamellar c^-structure is important and about 30% 
lamellar a 2 appears to be the optimum. 

9.6.4 Other intermetallic compounds 

A number of intermetallic compounds are already used 
in areas which do not rely on stringent mechanical 
properties. Fe 3 Al, for example, is used in fossil-fuel 
plants where resistance to both sulphur attack and 


oxidation is important. Ni 3 Si is used where resistance 
to hot sulphuric acid is required. There are several 
compounds with rare-earth elements used in magnet 
technology (see Chapter 6). Pdln is gold-coloured and 
a possible dental material. Zr 3 Al has a low neutron 
capture cross-section and is a possible reactor material. 

The /3-compound NiTi ( Nitinol ) is an important 
shape-memory alloy. The shape-memory effect (SME) 
manifests itself when the alloy is deformed into a shape 
while in a low-temperature martensitic condition but 
regains its original shape when the stress is removed 
and it is heated above the martensitic regime. Strains 
of the order of 8% can be completely recovered by 
the reverse transformation of the deformed marten- 
sitic phase to the higher-temperature parent phase. 
The martensite transformation in these alloys is a 
thermo-elastic martensitic transformation in which the 
martensite plates form and grow continuously as the 
temperature is lowered and are removed reversibly as 
the temperature is raised. NiTi was one of the orig- 
inal SME alloys, but there are many copper-based 
alloys which undergo a martensitic transformation, 
e.g. Cu-17Zr-7Al. Application of SME alloys relies 
on the characteristic that they can change shape repeat- 
edly as a result of heating and cooling and exert a force 
as the shape changes. By composition control (increas- 
ing the Ni content or substitution of Cu lowers the M s 
temperature of TiNi), the shape-memory can be trig- 
gered by normal body temperature or any other conve- 
nient temperature to operate a device. Several biomed- 
ical applications have been developed in orthopaedic 
devices (e.g. pulling fractures together), in orthodon- 
tics, in intrauterine contraceptives and in artificial 
hearts. Industrial applications include pipe couplings 
for ships which shrink during heating, electrical con- 
nectors, servo-mechanisms for driving recording pens, 
switches, actuators and thermostats. 
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Table 9.7 Aluminium alloy designation systems 


Wrought alloys 

Designation 

Casting alloys 

Designation 

99.00% (min.) aluminium 

1XXX 

99.00% (min.) aluminium 

1XX.X 

Copper 

2XXX 

Copper 

2XX.X 

Manganese 

3XXX 

Silicon with added copper and/or magnesium 

3XX.X 

Silicon 

4XXX 

Silicon 

4XX.X 

Magnesium 

5XXX 

Magnesium 

5XX.X 

Magnesium and silicon 

6XXX 

Zinc 

6XX.X 

Zinc 

7XXX 

Tin 

7XX.X 

Others 

8XXX 

Others 

8XX.X 


9.7 Aluminium alloys 

9.7.1 Designation of aluminium alloys 

Aluminium has an attractive combination of properties 
(i.e. low density, strong, easy to fabricate) which can 
be developed and modified by alloying and process- 
ing. The basic physical metallurgy has been outlined 
in Chapter 8 and hence in this section some of the 
alloys developed for particular industries such as the 
transportation, construction, electrical and packaging 
industries will be considered. 

Aluminium alloys are identified by a four-digit sys- 
tem 1 based on the main alloying element. This is 
summarized in Table 9.7. For wrought alloys the first 
digit identifies the alloy group and the second digit 
any modification to the original alloy which is iden- 
tified by the last two digits. The system is slightly 
different for casting alloys. The first digit again identi- 
fies the group, the second two digits identify the alloy 
and the last digit, preceded by a decimal point, indi- 
cates the product form (i.e. 0 for casting and 1 for 
ingot). 

9.7.2 Applications of aluminium alloys 

With the need for fuel economy and weight-saving, 
aluminium alloys are increasingly used in cars, and its 
two most important properties are density and thermal 
conductivity. Over the past 15 years the aluminium 
content of cars has increased from around 5% to 13% 
by both volume and weight. In engines they are used 
for pistons, cylinder heads and sumps. Al-Si cast- 
ing alloys of the 3XX.X series are being used for 
engine blocks and Al-Si pistons with cast iron cylin- 
der liners for wear resistance. The superior thermal 
conductivity reduces the volume of coolant in the 
system. Aluminium wheels, vacuum-cast or forged, 
are replacing conventional steel wheels in sports cars. 
Heat-treatable 2 2XXX and 6XXX as well as 5 XXX 

1 The International Alloy Designation System (IADS) was 
first introduced by the Aluminium Association of the USA 
and is now standard for wrought alloys. 

2 Alloy treatment is usually described by a suffix letter and 
digit system (e.g. F — as fabricated, O — annealed, 

H — work-hardening and T — heat-treated. Digits following 


series can be used for body sheet. However, the mod- 
ulus of aluminium is only one-third that of steel 
and hence significant design changes are necessary 
to maintain rigidity and stiffness. A straightforward 
increase in gauge thickness would lead to a doubling 
cost which limits the replacement to ‘quality’ cars. 
For bulk market cars, gauge for gauge substitution 
for steel is a future objective with structural reinforce- 
ment to enhance body torsional characteristics. This is 
possible with the use of adhesives in a weld-bonding 
approach which can reduce the weight by half and 
fuel consumption and C0 2 emission by almost 15% 
Aluminium- structured vehicle technology (ASVT) is 
likely to be essential when emission control to reduce 
global warming effects is tightened. 

The Honda NSX all-aluminium car 3 is manufactured 
with conventional design and assembly and has three 
different 6000 series alloys for external panels and a 
structural subframe of 5182 alloy. 

In aircraft construction use is made of the high 
strength-density ratio of the Al-Cu (2000 series) 
and Al-Zn-Mg (7000 series) alloys in extruded 
form for wing spars, fuselage and landing gear and 
for the skin in plate or sheet form; typically 7075 
(5.6Zn-2.5Mg-1.6Cu) is used in the T6 condition and 
2024 (4.4Cu-1.5Mg-0.6Mn) in the T3 or T8 con- 
ditions. The alloys of the 7000 series have higher 
strength than the 2000 alloys but lower resistance 
to fracture. However, higher-purity levels (e.g. 2124 
alloy), give enhanced toughness. Alloys of both series 
lose strength above 100°C and are thus not suitable 
for supersonic aircraft. The RR58 alloy used for the 
Concorde at temperatures up to 175°C was originally 
an early engine material, for compressor blades and 
impellers. 

Aluminium is used extensively in the construction 
industry because of its light weight, resistance to 
atmospheric attack and surface finish. For decorative 
applications, dyed anodic films produce a permanent 
durable finish. Generally, the Al-Mg-Si 6000 series 

H specify the work-hardened condition, and that following T 
the type of ageing treatment (e.g. T6 is solution heat-treated 
and artificially aged, T4 solution heat-treated and naturally 
aged, T3 solution heat-treated and cold-worked). 

3 Launched in the USA in 1992. 
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is used, i.e. 6063 medium- strength and 6082 higher- 
strength alloys in the T6 condition for extrusions or 
T4 where forming is required during fabrication. The 
Al-Cu (20 14 A) alloy is also used for heavily-loaded 
primary structures. 

The packaging industry also provides a large market 
for aluminium alloys. The main requirement is for low- 
cost, simple alloys capable of being formed and the 
Al-Mg (5000) series is often used. Impurity control is 
essential and liquid metal filtering is necessary in the 
production of thin sheet. 

In the electrical industry, electrical conductivity 
grade aluminium and higher- strength 5000 and 6000 
series alloys are used for transmission lines, replacing 
the more expensive copper. Dispersion-strengthened 
alloys containing a fine dislocation substructure sta- 
bilized by small precipitates are used for electrical 
wiring. 

9.7.3 Aluminium-lithium alloys 

The advantages of aluminium-lithium alloys have 
been known for a long time but lower density and 
increased elastic stiffness were offset by poor ductil- 
ity and fracture performance. Basic Al-Li alloys pre- 
cipitate the (AlsLi)^, a spherical ordered precipitate. 
Precipitation-hardening leads, however, to localized 
deformation with limited cross-slip and poor fracture 
behaviour. Additions of copper to the alloy so that the 
Li/Cu ratio is high leads to the formation of both 8' and 
a T i phase (Al 2 CuLi). This gives some improvement 
in fracture toughness by independent control of the two 
precipitates. In the quaternary system Al-Li-Cu-Mg 
the S -phase precipitates in addition to the 8' and 
T\. The S-phase is better at dispersing slip than 
and with adjustment of composition can be made to 
dominate the structure. Both S and T \ are nucleated 
heterogeneously on dislocations and the best results 
are obtained by cold-working the alloy after solution- 
treatment. 

Commercial alloys based on this background are 
Lital A, B and C which have been developed to 
match the (1) conventional medium- strength 2014- 
T6, (2) high-strength 7075-T6 and (3) damage-tolerant 
2024-T3 alloys, with a 10% reduction in density and 
10% improvement in stiffness (see Table 9.8). 

Lital A in T6 sheet form typically has 365 MN ur 2 
0.2% proof stress, 465 MN m -2 TS, 6% elongation, 
66 MN m _3/2 fracture toughness, an elastic modulus 
of 80 GN m -2 and density of 2550 kg m -3 . Lital B 
has roughly 10% improvement in strength. Lital C is a 


Table 9.8 Composition of commercial aluminium-lithium 
alloys. 


Alloy 

Li 

Cu 

Mg 

Fe 

Si 

Zr 

(8090) Lital A 

2.5 

1.3 

0.7 

< 0.2 

< 0.1 

0.12 

(9091) Lital B 

2.6 

1.9 

0.9 

< 0.2 

< 0.1 

0.12 


variant of the 8090 alloy and is heat-treated to increase 
toughness (~76 MN nrr 3/2 ) at the expense of strength 
(TS -440 MN m~ 2 ). 

Lithium additions are also being made to conven- 
tional aluminium alloys. The addition of lithium has 
a major influence since Li possesses a significant 
vacancy binding energy of about 0.25 eV. Lithium 
atoms therefore trap vacancies and form Li-V aggre- 
gates. This decreases the concentration of mobile 
vacancies available for the transport of zone-forming 
atoms, and therefore inhibits the diffusion of Zn and 
Mg in 7075, and Si and Mg in 6061, into zones. 
Second, the Li-V aggregates, very probably present 
during quenching and immediately after ageing, act as 
heterogeneous sites for subsequent clustering of zone- 
forming atoms during ageing. 

Additions of Li into either Al-2Mg-0.6Si-0.3Cu- 
0.3Cr (6061) or Al-5.9Zn-2.4Mg-l.7Cu (7075) 
modify the precipitation scheme and age-hardening 
behaviour of the original alloys. The precipitates 
which form in the base alloys are inhibited or even 
suppressed. For the 6061 the addition of 0.7% Li 
retards the precipitation of needle-shaped GP zones 
and produces a ternary compound AlLiSi, whereas 
the addition of 2.0% Li results in the dominant 
precipitation of 8' and extremely delayed and limited 
formation of needle-shaped GP zones and AlLiSi. For 
7075 the addition of 0.7% Li alters the conventional 
precipitation scheme from solute-rich GP zone -> 
rj' -> ?7MgZn 2 into vacancy-rich GP zone -> T' -> 
T(AlZn) 49 Mg 32 , whereas the addition of 2.0% Li 
produces the dominant 8' precipitate and limited and 
delayed formation of T-phase. As a result, the age- 
hardening response relating to these major hardening 
phases in both base alloys is delayed or decreased. 
Such additions can produce narrower PFZ’s and give 
improved fracture properties. 

A further commercial alloy is UL40 , which is essen- 
tially a binary alloy containing 4% Li. The alloy is 
cast using a spray-deposition process resulting in a 
fine-grained microstructure, with uniform distribution 
of second phase, free from oxide. The high Li con- 
tent alloy has a very low density (2400 kg m -3 ) and 
is almost a third lighter than conventional aluminium 
and magnesium alloys. It extrudes well and can be 
welded with Al-Mg-Zr filler, producing components 
for aircraft and helicopters, such as pump housings 
and valves, and for yachting with good corrosion 
resistance. 

9.7.4 Processing developments 

9.7.4. 1 Superplastic aluminium alloys 
Superplastic forming is a cost-effective manufactur- 
ing process for producing both simple and complex 
shapes from aluminium alloy sheet, because of its 
low-cost tooling and short lead times for produc- 
tion. A range of alloys is available including 2004 
(Al-6Cu-0.4Zr) or Supral, 5083 SPF, 7475 SPF and 
Lital 8090 SPF (Al-Li-Mg-Cu). Supral and Lital 
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dynamically recrystallize to a fine grain size during the 
early stages of deformation (~500°C) which is stabi- 
lized with ZrAl 3 particles. The grain size in 7475 is 
stabilized by TMP and submicron Cr particles and, in 
5083, by a Mn-dispersoid. Components are formed by 
clamping the alloy sheet in a pressure chamber and 
then applying gas or air pressure to force the sheet 
slowly into contact with a tool surface; both the tool 
and sheet are maintained at the forming temperature 
throughout the process. During normal superplastic 
forming, the alloys tend to develop voiding. This void 
formation is minimized by forming in hydrostatic con- 
ditions by introducing a gas pressure on both the front 
and back surfaces of the sheet being formed. The sheet 
is forced against the tool surface with a small pressure 
differential. 

9. 7. 4. 2 Rapid solidification processing of 
aluminium alloys 

Rapid solidification processing (RSP) has been applied 
to aluminium alloys to produce a fine grain size and 
extend solid solubility, particularly for the transition 
metals, iron, molybdenum, chromium and zirconium, 
which usually have low solid solubility and low diffu- 
sion rates in aluminium. Interestingly RSP alloys con- 
taining Fe and Cr, on annealing, precipitate metastable 
spherical quasi-crystals of icosahedral phase with five- 
fold symmetry. These are extremely stable and hardly 
coarsen after extensive heat-treatment which indicates 
a potential for alloy development. 

A series of commercially-available high-temperature 
Al-Fe-V-Si alloys has been developed and consist 



Temperature (°C) 



of very fine, spherical Al 13 (FeV) 3 Si silicides uni- 
formly dispersed throughout the matrix which display 
much slower coarsening rates than other dispersoids. 
A typical alloy with 27 vol.% silicides is 8009 
(Al-8.5Fe-l.3V-l.7Si) and, without any needle or 
platelet precipitates in the microstructure, has a K\ c 
~29 MN m -3/2 . The tensile properties as a function of 
temperature are shown in Figure 9.17 in comparison 
with a conventional 7075 -T6 alloy. At all tempera- 
tures up to 480° C, the 8009 alloy has a higher specific 
stiffness than a Ti-6A1-4V alloy. The fatigue and 
creep rupture properties are better than conventional 
aluminium alloys with excellent corrosion resistance. 
These RSP silicide alloys can be readily fabricated 
into sheet, extruded or forged and the combination of 
attractive properties makes them serious candidates for 
aerospace applications. Other alloys developed include 
Al-Cr-Zn-Mn, Al-8Fe-2Mo and Al-Li. 

9. 7.4. 3 Mechanical alloys of aluminium 
Mechanical alloys of aluminium contain dispersions 
of carbides or oxides, which not only produce disper- 
sion strengthening but also stabilize a fine-grain struc- 
ture. An advantage of these alloys arises because the 
strength is derived from the dispersoids and thus the 
composition of the alloy matrix can be designed prin- 
cipally for corrosion resistance and toughness rather 
than strength. Thus the alloying elements which are 
usually added to conventional aluminium alloys for 
precipitation- strengthening and grain size control may 
be unnecessary. 

Mechanical alloying is carried out with elemental 
powders and an organic process control agent, such as 



Figure 9.17 Tensile properties ofRSPAl alloy 8009 as a function of temperature compared with conventional aluminium 
alloys (from Gilman, 1990, p. 505). 
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stearic acid, to balance the cold-welding and powder 
fracture processes. No dispersoid is added because the 
oxide on the surface of the powders and process control 
agent are consolidated during mechanical alloying as 
hydrated oxides and carbonates. The process produces 
a fine dispersion of ~20 nm particles in a dynamically - 
recrystallized structure with grains as fine as 0.05 pm. 
Subsequent vacuum degassing at elevated tempera- 
ture removes the H 2 and N 2 liberated, improves the 
homogenization of the matrix and reduces carbonates 
to AI 4 C 3 which forms most of the dispersoid. The final 
grain size is around 0.1 pm. The powder is then com- 
pacted by HIPing, or vacuum hot-pressing and conven- 
tionally extruded to produce a material with a stable 
grain size of 0.3 pm, with grain boundaries pinned by 
the dispersoid. 

Mechanical alloys have been developed correspond- 
ing to the 2000, 5000 and 7000 aluminium series 
alloy. IN -9021 is heat-treatable by solution treatment 
and natural or elevated-temperature ageing to give 
500-560 MN m~ 2 proof stress, 570-600 MN m 2 TS, 
12% elongation and 40 MN m -3/2 K lc . IN-9052 is 
the equivalent of a 5000 series alloy, requiring no 
heat-treatment and offering good strength in thick 
sections; 390 MN m -2 proof stress, 470 MN m -2 TS, 
13% elongation and 46 MN m _3/2 K\ c . Mechanically 
alloyed Al-Mg-Li offers inherent high strength in 
thick section 430 MN m -2 proof-stress, 500 MN nrT 2 
TS, 10% elongation and 30 MN m _3/2 K\ c . 
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Chapter 10 
Ceramics and glasses 


10.1 Classification of ceramics 

The term ceramic, in its modern context, covers 
an extremely broad range of inorganic materials; 
they contain non-metallic and metallic elements and 
are produced by a wide variety of manufacturing 
techniques. Traditionally, ceramics are moulded from 
silicate minerals, such as clays, dried and fired at tem- 
peratures of 1200-1800°C to give a hard finish. Thus 
we can readily see that the original Greek word ker- 
amos, meaning ‘burned stuff or ‘kiln-fired material’, 
has long been directly appropriate. Modern ceramics, 
however, are often made by processes that do not 
involve a kiln-firing step (e.g. hot-pressing, reaction- 
sintering, glass-devitrification, etc.). Although ceram- 
ics are sometimes said to be non-metallic in character, 
this simple distinction from metals and alloys has 
become increasingly inadequate and arbitrary as new 
ceramics with unusual properties are developed and 
come into use. 

Ceramics may be generally classified, according 
to type or function, in various ways. In industrial 
terms, they may be listed as pottery, heavy clay prod- 
ucts (bricks, earthenware pipes, etc.), refractories (fire- 
bricks, silica, alumina, basic, neutral), cement and 
concrete, glasses and vitreous enamels, and engineer- 
ing (technical, fine) ceramics. Members of the final 
group are capable of very high strength and hardness, 
exceptional chemical stability and can be manufactured 
to very close dimensional tolerances. These will be 
our prime concern. Their introduction as engineering 
components in recent years has been based upon con- 
siderable scientific effort and has revolutionized engi- 
neering design practice. In general, the development 
of engineering ceramics has been stimulated by the 
drive towards higher, more energy-efficient, process 
temperatures and foreseeable shortages of strategic 
minerals. In contrast to traditional ceramics, which 
use naturally-occurring and, inevitably, rather variable 
minerals, the new generation of engineering ceramics 


depends upon the availability of purified and synthe- 
sized materials and upon close micro structural control 
during processing. Ceramics are subject to variability 
in their properties and statistical concepts often need 
to be incorporated into design procedures for stressed 
components. Design must recognize the inherent brit- 
tleness, or low resistance to crack propagation, and 
modify, if necessary, the mode of failure. Ceramics, 
because of their unique properties, show great promise 
as engineering materials but, in practice, their produc- 
tion on a commercial scale in specified forms with 
repeatable properties is often beset with many pro- 
blems. 

Using chemical composition as a basis, it is possible 
to classify ceramics into five main categories: 

1. Oxides — alumina, A1 2 0 3 (spark plug insulators, 
grinding wheel grits), magnesia, MgO (refrac- 
tory linings of furnaces, crucibles), zirconia, 
Zr0 2 (piston caps, refractory lining of glass 
tank furnaces), zirconia/alumina (grinding media), 
spinels, M 2+ 0 .M 2 + 0 3 (ferrites, magnets, transis- 
tors, recording tape), ‘fused’ silica glass (laboratory 
ware), 

2. Carbides — silicon carbide, SiC (chemical plant, 
crucibles, ceramic armour), silicon nitride, Si 3 N 4 
(spouts for molten aluminium, high-temperature 
bearings), boron nitride, BN (crucibles, grinding 
wheels for high-strength steels). 

3. Silicates — porcelain (electrical components), steat- 
ites (insulators), mullite (refractories). 

4. Sialons — based on Si-Al-O-N and M-Si-Al- 
O-N where M = Li, Be, Mg, Ca, Sc, Y, rare earths 
(tool inserts for high-speed cutting, extrusion dies, 
turbine blades). 

5. Glass-ceramics — Pyroceram, Cercor, Pyrosil (re- 
cuperator discs for heat exchangers). 

The preceding two methods of classifying ceramics, 
industrial and chemical, are of very little use to the 
materials scientist and technologist, who is primarily 
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concerned with structure/property relations. One can 
predict that a ceramic structure with a fine grain (crys- 
tal) size and low porosity is likely to offer advantages 
of mechanical strength and impermeability to contact- 
ing fluids. It is therefore scientifically appropriate to 
classify ceramic materials in microstructural terms, in 
the following manner: 

1. Single crystals of appreciable size (e.g. ruby laser 
crystal) 

2. Glass (non-crystalline) of appreciable size (e.g. 
sheets of ‘float’ glass) 

3. Crystalline or glassy filaments (e.g. E-glass for 
glass-reinforced polymers, single-crystal ‘whis- 
kers’, silica glass in Space Shuttle tiles) 

4. Poly crystalline aggregates bonded by a glassy 
matrix (e.g. porcelain pottery, silica refractories, 
hot-pressed silicon nitride) 

5. Glass-free poly crystalline aggregates (e.g. ultra- 
pure, fine-grained, ‘zero-porosity’ forms of alumina, 
magnesia and beryllia) 

6. Poly crystalline aggregates produced by heat- 
treating glasses of special composition (e.g. glass- 
ceramics) 

7. Composites (e.g. silicon carbide or carbon filaments 
in a matrix of glass or glass-ceramic, magnesia- 
graphite refractories, concrete). 

This approach to classifying ceramics places the 
necessary emphasis upon the crystalline and non- 
crystalline (glassy) attributes of the ceramic body, the 
significance of introducing grain boundary surfaces 
and the scope for deliberately mixing two phases with 
very different properties. 


10.2 General properties of ceramics 

The constituent atoms in a ceramic are held together 
by very strong bonding forces which may be ionic, 
covalent or a mixture of the two. As a direct conse- 
quence, their melting points are often very high, mak- 
ing them eminently suited for use in energy-intensive 
systems such as industrial furnaces and gas turbines. 
For instance, alumina primarily owes its importance 
as a furnace refractory material to its melting point of 


2050°C. The type of inter-atomic bonding is respon- 
sible for the relatively low electrical conductivity of 
ceramics. For general applications they are usually 
regarded as excellent electrical insulators, having no 
free electrons. However, ion mobility becomes signif- 
icant at temperatures above 5 00- 600° C and they then 
become progressively more conductive. This property 
can prove a problem in electric furnaces. 

The strength of ceramics under compressive stress- 
ing is excellent; accordingly, designers of ceramic 
artefacts as different as arches in buildings and metal- 
cutting tool tips ensure that the forces during service 
are essentially compressive. In contrast, the tensile 
strength of ceramics is not exceptional, sometimes 
poor, largely because of the weakening effect of sur- 
face flaws. Thus, in some cases, glazing with a thin 
vitreous layer can seal surface cracks and improve 
the tensile strength. The strength of ceramics is com- 
monly expressed as a modulus of rupture (MoR) value, 
obtained from three-point bend tests, because in the 
more conventional type of test with uniaxial loading, as 
used for metals, is difficult to apply with perfect uniax- 
iality; a slight misalignment of the machine grips will 
induce unwanted bending stresses. Ceramics are gen- 
erally regarded as brittle, non-ductile materials, with 
little or no plastic deformation of the microstructure 
either before or at fracture. For this reason, which 
rules out the types of production processes involv- 
ing deformation that are so readily applied to metals 
and polymers, ceramic production frequently centres 
on the manipulation and ultimate bonding together of 
fine powders. The inherent lack of ductility implies that 
ceramics are likely to have a better resistance to slow 
plastic deformation at very high temperatures (creep) 
than metals. 

The modulus of elasticity of ceramics can be excep- 
tionally high (Table 10.1). This modulus expresses 
stiffness, or the amount of stress necessary to pro- 
duce unit elastic strain, and, like strength, is a primary 
design consideration. However, it is the combination 
of low density with this stiffness that makes ceramics 
particularly attractive for structures in which weight 
reduction is a prime consideration. 

In aircraft gas turbines, ceramic blades have long 
been an interesting proposition because, apart from 
reducing the total mass that has to be levitated, they are 


Table 10.1 Specific moduli of various materials 



Modulus of elasticity 
(E/GN m~ 2 ) 

Bulk density 
(p/kg m~ 3 ) 

Specific modulus 
(E/p) 

Alumina 

345 

3800 

0.091 

Glass (crown) 

71 

2600 

0.027 

Aluminium 

71 

2710 

0.026 

Steel (mild) 

210 

7860 

0.027 

Oak (with grain) 

12 

650 

0.019 

Concrete 

14 

2400 

0.006 

Perspex 

3 

1190 

0.003 
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subject to lower centrifugal forces than metallic ver- 
sions. It is therefore common practice to appraise com- 
petitor materials for aircraft in terms of their specific 
moduli, in which the modulus of elasticity is divided 
by density. Ceramics consist largely of elements of low 
atomic mass, hence their bulk density is usually low, 
typically about 2000-4000 kg m -3 . Ceramics such as 
dense alumina accordingly tend to become pre-eminent 
in listings of specific moduli (Table 10.1). 

The strong interatomic bonding means that ceram- 
ics are hard as well as strong. That is, they resist 
penetration by scratching or indentation and are poten- 
tially suited for use as wear-resistant bearings and as 
abrasive particles for metal removal. Generally, impact 
conditions should be avoided. Interestingly, shape can 
influence performance; thus, the curved edges of din- 
ner plates are carefully designed to maximize resis- 
tance to chipping. Although the strength and hardness 
of materials are often related in a relatively simple 
manner, it is unwise to assume that a hard material, 
whether metallic or ceramic, will necessarily prove to 
be wear-resistant. Grinding of ceramics is possible, 
albeit costly. Strength can be enhanced in this way 
but great care is necessary as there is a risk that the 
machining operation will damage, rather than improve, 
the critical surface texture. 

During the consolidation and densification of a 
‘green’ powder compact in a typical firing operation, 
sintering of the particles gradually reduces the amount 
of pore space between contiguous grains. The final 
porosity, by volume, of the fired material ranges from 
30% to nearly zero. Together with grain size, it has 
a direct influence upon the modulus of rupture; thus, 
bone china, because of its finer texture, is twice as 
strong as fired earthenware. Pore spaces, particularly if 
interconnected, also lower the resistance of a ceramic 
structure to penetration by a pervasive fluid such as 
molten slag. On the other hand, deliberate encourage- 
ment of porosity, say 25-30% by volume, is used 
to lower the thermal conductivity of insulating re- 
fractories. 

Ceramics are often already in their highest state 
of oxidation. Not surprisingly, they often exhibit low 
chemical reactivity when exposed to hot oxidizing 
environments. Their refractoriness, or resistance to 
degradation and collapse during service at high tem- 
peratures, stems from the strong interatomic bonding. 
However, operational temperatures are subject to sud- 
den excursions and the resulting steep gradients of tem- 
perature within the ceramic body can give rise to stress 
imbalances. As the ceramic is essentially non-ductile, 
stresses are not relieved by plastic deformation and 
cracking may occur in planes roughly perpendicular 
to the temperature gradient, with portions of ceramic 
becoming detached from the hottest face. The sever- 
ity of this disintegration, known as spalling, depends 
mainly upon thermal expansivity (a) and conductivity 
( k ). Silica has a poor resistance to spalling whereas sili- 
con nitride can withstand being heated to a temperature 
of 1000°C and then quenched in cold water. 


The ability of certain ceramic oxides to exist in 
either crystalline or non-crystalline forms has been 
commented upon previously. Silica and boric oxide 
possess this ability. In glass-ceramics, a metastable 
glass of special composition is shaped while in the 
viscous condition, then heat-treated in order to induce 
nucleation and growth of a fine, completely crystalline 
structure. (This manipulation and exploitation of the 
crystalline and glassy states is also practised with 
metals and polymers.) This glass-forming potential is 
an important aspect of ceramic science. The property 
of transparency to light is normally associated with 
glasses, notably with the varieties based upon silica. 
However, transparency is not confined to glasses and 
single crystals. It is possible to produce some oxides, 
normally regarded as opaque, in transparent, polycrys- 
talline forms (e.g. hot-pressed magnesia). 

So far as sources in the earth’s crust are concerned, 
mineral reserves for ceramic production are relatively 
plentiful. While one might observe that important con- 
stituent elements such as silicon, oxygen and nitrogen 
are outstandingly abundant, it must also be recognized 
that the processes for producing the new ceramics can 
be very costly, demanding resort to highly specialized 
equipment and exacting process control. 


10.3 Production of ceramic powders 

The wide-ranging properties and versatility of mod- 
em engineering ceramics owe much to the ways in 
which they are manufactured. A fine powder is usually 
the starting material, or precursor; advanced ceram- 
ics are mainly produced from powders with a size 
range of 1-10 pm. Electrical properties are extremely 
stmcture-sensitive and there is a strong demand from 
the electronics industry for even finer particles (in the 
nanometre range). The basic purpose of the manufac- 
turing process is to bring particle surfaces together and 
to develop strong interparticle bonds. It follows that 
specific surface area, expressed per unit mass, is of par- 
ticular significance. Characterization of the powder in 
terms of its physical and chemical properties, such as 
size distribution, shape, surface topography, purity and 
reactivity, is an essential preliminary to the actual man- 
ufacturing process. Tolerances and limits are becoming 
more and more exacting. 

The three principal routes for producing high-grade 
powders are based upon solid-state reactions, solution 
and vaporization. The solid-state reaction route, long 
exemplified by the Acheson process for silicon carbide 
(Section 10.4.5.2), involves high temperatures. It is 
used in more refined forms for the production of 
other carbides (TiC, WC), super-conductive oxides 
and silicon nitride. An aggregate is produced and the 
necessary size reduction (comminution) introduces the 
risk of contamination. Furthermore, as has long been 
known in mineral-dressing industries, fine grinding is 
energy-intensive and costly. 
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The Bayer process for converting bauxite into alu- 
mina is a solution-treatment method. In this impor- 
tant process, which will be examined in detail later 
(Section 10.4.1.2), aluminium hydroxide is precipi- 
tated from a caustic solution and then converted to 
alumina by heating. Unfortunately, this calcination has 
a sintering effect and fine grinding of the resultant 
agglomerate is necessary. In the more recent spray- 
drying and spray -roasting techniques, which are widely 
used to produce oxide powders, sprayed droplets of 
concentrated solutions of appropriate salts are rapidly 
heated by a stream of hot gas. Again, there is a risk of 
agglomeration and grinding is often necessary. 

These difficulties, which stem from the inherent 
physical problem of removing all traces of solvent in a 
satisfactory manner, have encouraged development of 
methods based upon a ‘solution-to-gelation’ (sol-gel) 
approach. The three key stages of a typical sol -gel 
process are: 

1. Production of a colloidal suspension or solution 
(sol) (e.g. concentrated solution of metallic salt in 
dilute acid) 

2. Adjustment of pH, addition of a gelling agent, 
evaporation of liquid to produce a gel 

3. Carefully controlled calcination to produce fine 
particles of ceramic. 

Sol- gel methods are applicable to both ceramics and 
glasses and are capable of producing filaments as 
well as powders. One variant involves hydrolysis 
of distillation-purified alkoxides (formed by reacting 
metal oxides with alcohol). The hydroxide particles 
precipitated from the sol are spherical, uniform in 
shape and sub-micron sized. Sintering does not drasti- 
cally change these desirable characteristics. Although 
costs tend to be high and processing times are lengthy, 
sol- gel methods offer an attractive way to produce 
oxide powders, such as alumina, zirconia and titania, 
that will flow, form and sinter readily and give a prod- 
uct with superior properties. Currently, there is great 
interest in vapour phase methods that enable pow- 
ders with a particle size as small as 10-20 nm to 
be produced (e.g. oxides, carbides, nitrides, silicides, 
borides). The high-energy input required for vaporiza- 
tion is provided by electric arcs, plasma jets or laser 
beams. The powder is condensed within a carrier gas 
and then separated from the gas stream by impinge- 
ment filters or electrostatic precipitators. Sometimes, 
in a chemical vapour deposition process (CVD), a thin 
film is condensed directly upon a substrate. 

The manufacture of an advanced ceramic usually 
involves a number of steps, or unit operations. Each 
operation is subject to a number of interacting vari- 
ables (time, temperature, pressure, etc.) and, by having 
a very specific effect upon the developing structure 
(macro- and micro-), makes its individual contribution 
to the final quality of the product. When ductile met- 
als are shaped by plastic deformation, each operation 
stresses the material and is likely to reveal flaws. (For 


instance, the ability of an austenitic stainless steel to 
be cold-drawn to the dimensions of a fine hypodermic 
needle tube is strong evidence of structural integrity.) 
Individual ceramic particles are commonly brittle and 
non-deformable; consequently, manufacturing routes 
usually avoid plastic deformation and there is a greater 
inherent risk that flaws will survive processing without 
becoming visible or causing actual disintegration. The 
final properties of an advanced ceramic are extremely 
sensitive to any form of structural heterogeneity. The 
development of special ceramics and highly-innovative 
production techniques has encouraged greater use of 
non-destructive evaluation (NDE) techniques at key 
points in the manufacturing programme. At the design 
stage, guidelines of the following type are advisedly 
applied to the overall plan of production: 

1. Precursor materials, particularly ultra-fine powders, 
should be scientifically characterized. 

2. Each and every unit operation should be closely 
studied and controlled. 

3. NDE techniques should be carefully integrated 
within the overall scheme of operations. 

10.4 Selected engineering ceramics 

10.4.1 Alumina 

10.4.1.1 General properties and applications of 
alumina 

Alumina is the most widely used of the twenty or so 
oxide ceramics and is often regarded as the historic 
forerunner of modern engineering ceramics. The actual 
content of alumina, reported as A1 2 0 3 , ranges from 
85% to 99.9%, depending upon the demands of the 
application. 

Alumina-based refractories of coarse grain size are 
used in relatively massive forms such as slabs, shapes 
and bricks for furnace construction. Alumina has a 
high melting point (2050° C) and its heat resistance, 
or refractoriness, has long been appreciated by fur- 
nace designers. In fact, there has been a trend for 
aluminosilicate refractories (based upon clays) to be 
replaced by more costly high-alumina materials and 
high-purity alumina. Interatomic bonding forces, partly 
ionic and partly covalent, are extremely strong and the 
crystal structure of alumina is physically stable up to 
temperatures of 1500-1700°C. It is used for protec- 
tive sheaths for temperature-measuring thermocouples 
which have to withstand hot and aggressive environ- 
ments and for filters which remove foreign particles 
and oxide dross from fast-moving streams of molten 
aluminium prior to casting. Large refractory blocks 
cast from fused alumina are used to line furnaces for 
melting glass. However, although alumina is a heat- 
resisting material with useful chemical stability, it is 
more sensitive to thermal shock than silicon carbide 
and silicon nitride. A contributory factor is its rela- 
tively high linear coefficient of thermal expansion (a). 
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The respective a- values/ x 10 -6 K -1 for silicon car- 
bide, silicon nitride and alumina are 8, 4.5 and 3.5. 

When intended for use as engineering components 
at lower temperatures, alumina ceramics usually have a 
fine grain size (0.5-20 pm) and virtually zero porosity. 
Development of alumina to meet increasingly stringent 
demands has taken place continuously over many years 
and has focused mainly upon control of chemical com- 
position and grain structure. The chemical inertness 
of alumina and its biocompatibility with human tissue 
have led to its use for hip prostheses. An oft-quoted 
example of the capabilities of alumina is the insulat- 
ing body of the spark-ignition plug for petrol-fuelled 
engines (Figure 10.1). Its design and fabrication meth- 
ods have been steadily evolving since the early 1900s. 
In modern engines, trouble-free functioning of a plug 
depends primarily upon the insulating capability of 
its isostatically-pressed alumina body. Each plug is 
expected to withstand temperatures up to 1000°C, sud- 
den mechanical pressures, corrosive exhaust gases and 
a potential difference of about 30 kV while ‘firing’ pre- 
cisely 50-100 times per second over long periods of 
time. Plugs are provided with a smooth glazed (glassy) 
surface so that any electrically-conductive film of con- 
tamination can be easily removed. 

The exceptional insulating properties and range of 
alumina ceramics have long been recognized in the 
electrical and electronics industries (e.g. substrates for 
circuitry, sealed packaging for semiconductor micro- 
circuits). Unlike metals, there are no ‘free’ electrons 
available in the structure to form a flow of current. 
The dielectric strength, which is a measure of the abil- 
ity of a material to withstand a gradient of electric 
potential without breakdown or discharge, is very high. 
Even at temperatures approaching 1000°C, when the 
atoms tend to become mobile and transport some elec- 
trical charge, the resistivity is still significantly high. 
Electrical properties usually benefit when the purity of 
alumina is improved. 

Many mass-produced engineering components take 
advantage of the excellent compressive strength, hard- 
ness and wear resistance of alumina (e.g. rotating 
seals in washing machines and in water pumps for 
automobile engines, machine jigs and cutting tools, 
soil-penetrating coulters on agricultural equipment, 
shaft bearings in watches and tape-recording machines, 
guides for fast-moving fibres and yams, grinding 
abrasives). (Emery, the well-known abrasive, is an 
impure anhydrous form of alumina which contains 
as much as 20% Si0 2 + Fe 2 0 3 ; pretreatment is often 
unnecessary.) The constituent atoms in alumina, alu- 
minium and oxygen, are of relatively low mass and the 
correspondingly low density (3800 kg m~ 3 ) is often 
advantageous. However, like most ceramics, alumina 
is brittle and should not be subjected to either impact 
blows or excessive tensile stresses during service. 

Alumina components are frequently quite small but 
their functioning can vitally affect the performance and 
overall efficiency of a much larger engineering system. 



Figure 10.1 Spark plug for petrol engine ( with 
acknowledgements to Champion Spark Plug Division of 
Cooper GB Ltd). 

Spark-plug insulators 1 and water-pump sealing rings in 
internal combustion engines are striking examples of 
this principle at work. 

10.4.1.2 Preparation and shaping of alumina 
powders 

Examination of the general form of the production 
route for alumina ceramics from ore to finished shape 
provides an insight into some of the important factors 
and working principles which guide the ceramics tech- 
nologist and an indication of the specialized shaping 
methods that are available for ceramics. As mentioned 
earlier, each stage of the production sequence makes 
its own individual and vital contribution to the final 
quality of the product and must be carefully controlled. 

The principal raw material for alumina production 
is bauxite A1 2 0(0H) 4 , an abundant hydrated rock 
occurring as large deposits in various parts of the 

! Over the period 1902-1977 Robert Bosch Ltd developed 
more than 20000 different types of spark plug. 
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world. 2 In the Bayer process, prepared bauxitic ore 
is digested under pressure in a hot aqueous solution 
of sodium hydroxide and then ‘seeded’ to induce pre- 
cipitation of Al(OH) 3 crystals, usually referred to by 
the mineral term ‘gibbsite’. (The conditions of time, 
temperature, agitation, etc. during this stage greatly 
influence the quality of the Bayer product.) Gibbsite 
is chemically decomposed by heating (calcined) at a 
temperature of 1200°C. Bayer calcine, which consists 
of o'-alumina (>99% A1 2 0 3 ), is graded according to 
the nature and amount of impurities. Sodium oxide, 
Na 2 0, ranges up to 0.6% and is of special signifi- 
cance because it affects sintering behaviour and elec- 
trical resistance. The calcine consists of agglomerates 
of o'-alumina crystallites which can be varied in aver- 
age size from 0.5 to 100 pm by careful selection of 
calcining conditions. 

Bayer calcine is commonly used by manufactur- 
ers to produce high-purity alumina components as 
well as numerous varieties of lower-grade components 
containing 85-95% A1 2 0 3 . For the latter group, the 
composition of the calcine is debased by additions 
of oxides such as Si0 2 , CaO and MgO which act as 
fluxes, forming a fluid glassy phase between the grains 
of o'-alumina during sintering. 

The chosen grade of alumina, together with any 
necessary additives, is ground in wet ball-mills to 
a specified size range. Water is removed by spray- 
ing the aqueous suspension into a flow of hot gas 
(spray-drying) and separating the alumina in a cyclone 
unit. The free-flowing powder can be shaped by a 
variety of methods (e.g. dry, isostatic-or hot-pressing, 
slip- or tape-casting, roll-forming, extrusion, injection- 
moulding). Extremely high production rates are often 
possible; for instance, a machine using air pres- 
sure to compress dry powder isostatically in flexible 
rubber moulds (‘bags’) can produce 300-400 spark 
plug bodies per hour. In some processes, binders 
are incorporated with the powder; for instance, a 
thermoplastic can be hot-mixed with alumina powder 
to facilitate injection-moulding and later burned off. In 
tape-casting, which produces thin substrates for micro- 
electronic circuits, alumina powder is suspended in an 
organic liquid. 

10.4.1.3 Densification by sintering 
The fragile and porous ‘green’ shapes are finally fired 
in kilns (continuous or intermittent). Firing is a costly 
process and, wherever possible, there has been a natu- 
ral tendency to reduce the length of the time cycle for 
small components. Faster rates of cooling after ‘soak- 
ing’ at the maximum temperature have been found to 
give a finer, more desirable grain structure. 

2 Long-distance transportation costs have prompted 
investigation of alternative sources. For instance, roasted 
kaolinite can be leached in concentrated hydrochloric or 
sulphuric acid, then precipitated as an aluminium salt which 
is calcined to form alumina. 


It has been mentioned that fluxing oxides are added 
to lower-grade aluminas in order to form an intergranu- 
lar phase(s). Although this fluid inter-granular material 
facilitates densification during firing, its presence in 
the final product can have a detrimental effect upon 
strength and resistance to chemical attack. As a con- 
sequence, powders of high alumina content are chosen 
for demanding applications. In general, an increase in 
alumina content from 88% to 99.8% requires a corre- 
sponding increase in firing temperature from 1450°C to 
1750°C. ‘Harder’ firing incurs heavier energy costs and 
has led to the development of reactive alumina which 
has an extremely small particle size (1 pm) and a large 
specific surface. ‘Softer’ firing temperatures became 
possible with this grade of alumina and the need to 
debase the alumina with relatively large amounts of 
additives was challenged. 

Shrinkage is the most apparent physical change 
to take place when a ‘green’ ceramic compact is 
fired. The linear shrinkage of alumina is about 20% 
and dimensions may vary by up to ±1%. Diamond 
machining is used when greater precision is needed 
but requires care as it may damage the surface and 
introduce weakening flaws. 

10.4.2 From silicon nitride to sialons 

10.4.2.1 Reaction-bonded silicon nitride 
(RBSN) 

Silicon nitride, which can be produced in several ways, 
has found application under a variety of difficult condi- 
tions (e.g. cutting tools, bearings, heat engines, foundry 
equipment, furnace parts, welding jigs, metal-working 
dies, etc.). Its original development was largely stimu- 
lated by the search for improved materials for gas tur- 
bines. Prior to its development in the 1950s, the choice 
of fabrication techniques for ceramics was restricted 
and it was difficult to produce complex ceramic shapes 
to close dimensional tolerances. The properties avail- 
able from existing materials were variable and specific 
service requirements, such as good resistance to ther- 
mal shock and attack by molten metal and/or slag, 
could not be met. The development of silicon nitride 
minimized these problems; it has also had a profound 
effect upon engineering thought and practice. 

Silicon nitride exists in two crystalline forms (a, /3): 
both belong to the hexagonal system. Bonding is pre- 
dominantly covalent. Silicon nitride was first produced 
by an innovative form of pressureless sintering. First, a 
fragile pre-form of silicon powder (mainly o'-Si 3 N 4 ) is 
prepared, using one of a wide variety of forming meth- 
ods (e.g die-pressing, isostatic-pressing, slip-casting, 
flame- spraying, polymer-assisted injection-moulding, 
extrusion). In the first stage of a reaction-bonding pro- 
cess, this pre-form is heated in a nitrogen atmosphere 
and the following chemical reaction takes place: 

3Si + 2N 2 = Si 3 N 4 
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A reticular network of reaction product forms through- 
out the mass, bonding the particles together with- 
out liquefaction. Single crystal ‘whiskers’ of o'- silicon 
nitride also nucleate and grow into pore space. Reac- 
tion is strongly exothermic and close temperature con- 
trol is necessary in order to prevent degradation of 
the silicon. The resultant nitrided compact is strong 
enough to withstand conventional machining. In the 
second and final stage of nitridation, the component 
is heated in nitrogen at a temperature of 1400°C, 
forming more silicon nitride in situ and producing a 
slight additional change in dimensions of less than 1%. 
(Alumina articles can change by nearly 10% during 
firing.) The final micro structure consists of a-Si 3 N 4 
(60-90%), /3- Si 3 N 4 (10-40%), unreacted silicon and 
porosity (15-30%). As with most ceramics, firing is 
the most costly stage of production. 

The final product, reaction-bonded silicon nitride 
(RBSN), has a bulk density of 2400-2600 kg m -3 . 
It is strong, hard and has excellent resistance to wear, 
thermal shock and attack by many destructive fluids 
(molten salts, slags, aluminium, lead, tin, zinc, etc.). 
Its modulus of elasticity is high. 

10.4.2.2 Hot-pressed forms of silicon nitride 
(HPSN, HIPSN) 

In the early 1960s, a greater degree of densification was 
achieved with the successful production of hot-pressed 
silicon nitride (HPSN) by G. G. Deeley and co-workers 
at the Plessey Co. UK. Silicon nitride powder, which 
cannot be consolidated by solid-state sintering alone, 
is mixed with one or more fluxing oxides (magne- 
sia, yttria, alumina) and compressed at a pressure of 
23 MN m -2 within radio-frequency induction-heated 
graphite dies at temperatures up to 1850°C for about 
1 h. The thin film of silica that is usually present on sil- 
icon nitride particles combines with the additive(s) and 
forms a molten phase. Densification and mass trans- 
port then take place at the high temperature in a typical 
‘liquid-phase’ sintering process. As this intergranular 
phase cools, it forms a siliceous glass which can be 
encouraged to crystallize (devitrify) by slow cooling 
or by separate heat-treatment. This HP route deliber- 
ately produces a limited amount of second phase (up to 
3% v/v) as a means of bonding the refractory particles; 
however, this bonding phase has different properties to 
silicon nitride and can have a weakening effect, particu- 
larly if service temperatures are high. Thus, with 3-5% 
added magnesia, at temperatures below the softening 
point of the residual glassy phase, say 1000°C, silicon 
nitride behaves as a brittle and stiff material; at higher 
temperatures, there is a fairly abrupt loss in strength, 
as expressed by modulus of rupture (MoR) values, and 
slow deformation under stress (creep) becomes evident. 
For these reasons, controlled modification of the struc- 
ture of the inter-granular residual phase is of particular 
scientific concern. 

Yttria has been used as an alternative densifier to 
magnesia. Its general effect is to raise the softening 


point of intergranular phase significantly. More specifi- 
cally, it yields crystalline oxynitrides (e.g. Y 2 Si 3 0 3 N 4 ) 
which dissolve impurities (e.g. CaO) and form refrac- 
tory solid solutions (‘mixed crystals’). Unfortunately, 
at high temperatures, yttria-containing silicon nitride 
has a tendency to oxidize in a catastrophic and disrup- 
tive manner. 

Although the use of dies places a restriction upon 
component shape, hot-pressing increases the bulk den- 
sity and improves strength and corrosion resistance. 
The combination of strength and a low coefficient 
of thermal expansion (approximately 3.2 x 10 -6 °C _1 
over the range 25-1000°C) in hot-pressed silicon 
nitride confer excellent resistance to thermal shock. 
Small samples of HPSN are capable of surviving 100 
thermal cycles in which immersion in molten steel 
(1600°C) alternates with quenching into water. 

In a later phase of development, other researchers 
used hot isostatic-pressing (HIPing) to increase density 
further and to produce much more consistent proper- 
ties. Silicon nitride powder, again used as the starting 
material, together with a relatively small amount of the 
oxide additive(s) that promote liquid-phase sintering, is 
formed into a compact. This compact is encapsulated 
in glass (silica or borosilicate). The capsule is evac- 
uated at a high temperature, sealed and then HIPed, 
with gas as the pressurizing medium, at pressures up 
to 300 MN m -2 for a period of 1 h. Finally, the glass 
envelope is removed from the isotropic HIPSN compo- 
nent by sand-blasting. Like HPSN, its micro structure 
consists of /3-Si 3 N 4 (>90%) and a small amount of 
intergranular residue (mainly siliceous glass). 

Production routes involving deformation at very high 
temperatures and pressures, as used for HPSN and 
HIPSN, bring about a desirable closure of pores but 
inevitably cause a very substantial amount of shrinkage 
(20-30%). (In contrast to HPSN and HIPSN, RBSN 
undergoes negligible shrinkage during sintering at the 
lower process temperature of 1400°C and accordingly 
contains much weakening porosity, say 15-30% v/v.) 
By the early 1970s, considerable progress had been 
made in producing silicon nitride by reaction-bonding, 
hot-pressing and other routes. However, by then it had 
become evident that further significant improvements 
in the quality and capabilities of silicon nitride were 
unlikely. At this juncture, attention shifted to the sialons. 

10.4.2.3 Scientific basis of sialons 
Although silicon nitride possesses extremely useful 
properties, its engineering exploitation has been ham- 
pered by the difficulty of producing it in a fully dense 
form to precise dimensional tolerances. Hot-pressing 
offers one way to surmount the problem but it is a 
costly process and necessarily limited to simple shapes. 
The development of sialons provided an attractive and 
feasible solution to these problems. 

Sialons are derivatives of silicon nitride and are 
accordingly also classified as nitrogen ceramics. The 
acronym ‘sialon’ signifies that the material is based 
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upon the Si-Al-O-N system. In 1968, on the basis of 
structural analyses of silicon nitrides, it was predicted 1 
that replacement of nitrogen (N 3- ) by oxygen (O 2- ) 
was a promising possibility if silicon (Si 4+ ) in the 
tetrahedral network could be replaced by aluminium 
(Al 3+ ), or by some other substituent of valency lower 
than silicon. Furthermore, it was also predicted that 
systematic replacement of silicon by aluminium would 
allow other types of metallic cation to be accommo- 
dated in the structure. Such replacement within the 
SiN 4 structural units of silicon nitride would make 
it possible to simulate the highly versatile manner in 
which Si0 4 and A10 4 tetrahedra arrange themselves in 
aluminosilicates. A similarly wide range of structures 
and properties was anticipated for this new family of 
ceramic ‘alloys’. About two years after the vital pre- 
diction, British and Japanese groups, acting indepen- 
dently, produced ^'-silicon nitride, the solid solution 
which was to be the prototype of the sialon family. 

In /3-silicon nitride, the precursor, SiN 4 tetrahedra 
form a network structure. Each tetrahedron has a cen- 
tral Si 4+ which is surrounded by four equidistant N 3- 
(Figure 10.2). Each of these comer N 3- is common to 



Figure 10.2 The crystal structure of f-Si3N 4 and 
f'-(Si,Al)3(0,N) 4 0 metal atom, O non-metal atom ( from 
Jack, 1987, pp. 259-88; reprinted by permission of the 
American Ceramic Society). 

x By K. H. Jack and colleagues at the University of 
Newcastle-upon-Tyne; separate British and Japanese groups 
filed patents for producing sialons in the early 1970s. The 
writings of K. H. Jack on silicon nitride and sialons provide 
an insight into the complexities of developing a new 
engineering material. 


three tetrahedra. In the unit cell, six Si 4+ ions balance 
the electrical charge of eight N 3- , giving a starting for- 
mula Si 6 N 8 . Replacement of Si 4+ and N 3- by Al 3+ and 
O 2- , respectively, forms a /T-sialon stmcture which 
is customarily represented by the chemical formula 
Si 6 _ z Al z O z N 8 _ z , where z = number of nitrogen atoms 
replaced by oxygen atoms. The term z ranges in value 
from 0 to 4. Although considerable solid solution in 
silicon nitride is possible, the degree of replacement 
sought in practice is often quite small. With replace- 
ment, the formula for the tetrahedral unit changes from 
SiN 4 to (Si, Al) (O, N) 4 and the dimensions of the unit 
cell increase. 

Although replacement causes the chemical compo- 
sition to shift towards that of alumina, the structural 
coordination in the solid solution is fourfold (A10 4 ) 
whereas in alumina it is sixfold (A106). The strength 
of the Al-0 bond in a sialon is therefore about 50% 
stronger than its counterpart in alumina; this concentra- 
tion of bonding forces between aluminium and oxygen 
ions makes a sialon intrinsically stronger than alumina. 

The problem of representing complex phase rela- 
tionships in a convenient form was solved by adopt- 
ing the ‘double reciprocal’ diagram, a type of phase 
diagram originally developed for inorganic salt sys- 
tems by German physical chemists many years ago. 
Figure 10.3 shows how a tetrahedron for the four ele- 
ments Si, Al, O and N provides a symmetrical frame 
of reference for four compounds. By using linear 
scales calibrated in equivalent % (rather than the usual 
weight, or atomic %), each compound appears mid- 
way on a tetrahedral edge and the resulting section is 
square. An isothermal version of this type of diagram 


O 



% equivalent N 

Figure 10.3 Relation between Si-Al-O-N tetrahedron and 
square Si 3 -AI 4 -AI 4 N 4 -Si 3 N 4 plane. 
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Figure 10.4 Si-Al-O-N behaviour diagram at 1800° C 
( from Jack, 1987, pp. 259-88; reprinted by permission of 
the American Ceramic Society). 


is shown in Figure 10.4. The ‘double reciprocal’ char- 
acteristic refers to the equivalent interplay of N/O and 
Si/Al along the vertical and horizontal axes, respec- 
tively. It is necessarily assumed that the valency of the 
four elements is fixed (i.e. Si 4+ , Al 3+ , 0 2 ~ and N 3 ~). 
As the formula for the component Si 3 N 4 contains 12 
cations and 12 anions, the formulae for the other three 
components and for the various intermediate phases 
along the axes are expressed in the forms which give 
a similar charge balance (e.g. Si 3 0 6 rather than Si0 2 ). 
The equivalent % of a given element in these formulae 
can be derived from the following equations: 

Equivalent % oxygen 

100(atomic %0 x 2) 

(atomic %0 x 2) + (atomic %N x 3) 
Equivalent % nitrogen 

= 100% — equivalent % oxygen 
Equivalent % aluminium 

100 (atomic %A1 x 3) 

(atomic %A1 x 3) + (atomic %Si x 4) 
Equivalent % silicon 

= 100% — equivalent %A1 

Thus the intermediate phase labelled 3/2 (Si 2 N 2 0) 
contains 25 equivalent % oxygen and is located one 
quarter of the distance up the left-hand vertical scale. 

An interesting feature of the diagram is the parallel 
sequence of phases near the aluminium nitride corner 
(i.e. 27R, 21R, 12H, 15R and 8H). They are referred 
to as aluminium nitride ‘polytypoids’, or ‘polytypes’. 
They have crystal structures that follow the pattern 
of wurtzite (hexagonal ZnS) and are generally stable, 
refractory and oxidation-resistant. 


10.4.2.4 Production of sialons 
The start point for sialon production from silicon 
nitride powder at a temperature of 1800°C will lie 
in the vicinity of the bottom left-hand corner of 
Figure 10.4. Simultaneous replacement of N with O 
and Si with A1 produces the desired yd’ -phase which 
is represented by the narrow diagonal zone project- 
ing towards the A1 4 0 6 comer. Such ‘alloying’ of 
the ceramic structure produces progressive and subtle 
changes in the stmcture of silicon nitride by altering 
the balance between covalent and ionic bonding forces. 
The resultant properties can be exceptional. Impor- 
tantly, the oxidation resistance and strength of sialons 
at temperatures above 1000°C are greatly superior to 
those of conventional silicon nitride. Relatively sim- 
ple fabrication procedures, similar to those used for 
oxide ceramics, can be adopted. Pressureless-sintering 
enables dense complex shapes of moderate size to be 
produced. 

yd-Si 3 N 4 powder is the principal constituent of the 
starting mixture for ‘alloying’. (As mentioned previ- 
ously, these particles usually carry a thin layer of 
silica.) Although fine aluminium nitride would appear 
to be an appropriate source of replacement aluminium, 
it readily hydrolyses, making it impracticable to use 
fabrication routes which involve aqueous solutions or 
binders. One patented method for producing a yd' -sialon 
(z = 1) solves this problem by reacting the silicon 
nitride (and its associated silica) with a specially- 
prepared ‘polytypoid’. The phase relations for this 
method are shown in Figure 10.4. 

An addition of yttrium oxide to the mixture 
causes an intergranular liquid phase to form during 
pressureless-sintering and encourage densification. By 
controlling conditions, it is possible to induce this 
phase either to form a glass or to crystallize 
(devitrify). In sialons, as in many other ceramics, 
the final character of the intergranular phase has 
a great influence upon high-temperature strength. A 
structure of ft grains + glass is strong and resists 
thermal shock at temperatures approaching 1000°C. 
However, at higher temperatures the glassy phase 
deforms in a viscous manner and strength suffers. 
Improved stability and strength can be achieved by a 
closely-controlled heat-treatment which transforms the 
glassy phase into crystals of yttrium-aluminium-gamet 
(YAG), as represented in the following equation: 

Si 5 A10N 7 + Y-Si-Al-O-N 

yd' -sialon Oxynitride 

(z = 1) glass 

Sis+xAli _ x Oi_ x N 7+x + y 3 ai 5 o 12 

Modified YAG 

yd'-sialon 

The two-phase structure of yd' grains + YAG is 
extremely stable. It does not degrade in the presence of 
molten metals and maintains strength and creep resis- 
tance up to a temperature of 1400°C. 
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More recent work has led to the production of 
sialons from precursors other than /3-silicon nitride 
(e.g. a'-sialons from a-silicon nitride and O'-sialons 
from oxynitrides). K. H. Jack proposed that o'-silicon 
nitride, unlike the /3-form, is not a binary compound 
and should be regarded as an oxynitride, a defect struc- 
ture showing limited replacement of nitrogen by oxy- 
gen. The formula for its structural unit approximates 
to SiN 3 . 9 O 0 . 1 - Dual-phase or composite structures have 
also been developed in which paired combinations of 
/3'-, a'- and O'- phases provide enhancement of engi- 
neering properties. Sometimes, as in a' /ft composites, 
there is no glassy or crystalline intergranular phase. 
The sialon principle can be extended to some unusual 
natural waste materials. For instance, two siliceous 
materials, volcanic ash and burnt rice husks, have each 
been used in sinter mixes to produce sialons. Although 
such products are low grade, it has been proposed that 
they could find use as melt-resistant refractories. 

10.4.2.5 Engineering applications of sialons 
The relative ease with which sialons can be shaped is 
one of their outstanding characteristics. Viable shap- 
ing techniques include pressing (uniaxial, isostatic), 
extrusion, slip-casting and injection-moulding; their 
variety has been a great stimulus to the search for 
novel engineering applications. Similarly, their ability 
to densify fully during sintering at temperatures in the 
order of 1800°C, without need of pressure application, 
favours the production of complex shapes. However, 
due allowance must be made for the large amount of 
linear shrinkage (20-25%) which occurs as a result 
of liquid phase formation during sintering. Although 
final machining with diamond grit, ultrasonic energy 
or laser beam energy is possible, the very high hard- 
ness of sialons encourages adoption of a near-net- shape 
approach to design. As with many other engineering 
ceramics, sialon components are extremely sensitive 
to shape and it is generally appreciated that a change 
in curvature or section can frequently improve service 
performance. The structure of a sialon is, of course, the 
main determinant of its properties. Fortunately, sialons 
are very responsive to ‘alloying’ and combinations of 
attributes such as strength, stability at high tempera- 
tures, resistance to thermal shock, mechanical wear and 
molten metals can be developed in order to withstand 
onerous working conditions. 

During metal-machining, tool tips are subjected 
to highly destructive and complex conditions which 
include high local temperatures and thermal shock, 
high stresses and impact loading, and degradation by 
wear. At a test temperature of 1000°C, the indentation 
hardness of /f -sialon (+ glass) is much greater than 
that of either alumina or cobalt-bonded tungsten car- 
bide (Figure 10.5). The introduction of tool tips made 
from this sialon was a notable success. They were 
found to have a longer edge life than conventional 
tungsten carbide inserts, could remove metal at high 
speed with large depths of cut and could tolerate the 
shocks, mechanical and thermal, of interrupted cutting. 



Figure 10.5 Hot hardness of sialon, alumina and WC/Co 
cutting tool tips (from Jack, 1987, pp. 259-88; reprinted by 
permission of the American Ceramic Society). 

The strength and wear resistance of sialons led to 
their use in the metal-working operations of extrusion 
(hot- and cold-) and tube-drawing. In each process, 
the relative movement of the metal stock through the 
die aperture should be fast with low friction and mini- 
mal die wear, producing closely dimensioned bar/tube 
with a smooth and sound surface texture. Sialon die 
inserts have been successfully used for both fer- 
rous and non-ferrous metals and alloys, challenging 
the long-established use of tungsten carbide inserts. 
Sialons have also been used for the plugs (captive 
or floating) which control bore size during certain 
tube-drawing operations. It appears that the absence 
of metallic microconstituents in sialons obviates the 
risk of momentary adhesion or ‘pick-up’ between dies 
and/or plugs and the metal being shaped. Sialon tools 
have made it possible to reduce the problems normally 
associated with the drawing of difficult alloys such as 
stainless steels. 

The endurance of sialons at high temperatures and 
in the presence of invasive molten metal or slag has 
led to their use as furnace and crucible refractories. 
On a smaller scale, sialons have been used for com- 
ponents in electrical machines for welding (e.g. gas 
shrouds, locating pins for the workpiece). These appli- 
cations can demand resistance to thermal shock and 
wear, electrical insulation, great strength as well as 
immunity to attack by molten metal spatter. Sialons 
have proved superior to previous materials (alumina, 
hardened steel) and have greatly extended the service 
life of these small but vital machine components. 
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The search for greater efficiency in automotive 
engines, petrol and diesel, has focused attention on 
regions of the engine that are subjected to the most 
severe conditions of heat and wear. Sialons have 
been adopted for pre-combustion chambers in indi- 
rect diesel engines. Replacement of metal with ceramic 
also improves the power/weight ratio. It is still possible 
that the original goal of researches on silicon nitride 
and sialons, the ceramic gas turbine, will eventually be 
achieved. 

10.4.3 Zirconia 

Zirconium oxide (Zr0 2 ) has a very high melting point 
(2680° C), chemical durability and is hard and strong; 
because of these properties, it has long been used for 
refractory containers and as an abrasive medium. At 
temperatures above 1200°C, it becomes electrically 
conductive and is used for heating elements in furnaces 
operating with oxidizing atmospheres. Zirconia-based 
materials have similar thermal expansion characteris- 
tics to metallic alloys and can be usefully integrated 
with metallic components in heat engines. In addition 
to these established applications, it has been found 
practicable to harness the structural transitions of zir- 
conia, thereby reducing notch- sensitivity and raising 
fracture toughness values into the 15-20 MN m~ 3/2 
band, thus providing a new class of toughened ceram- 
ics. This approach is an alternative to increasing the 
toughness of a ceramic by either (1) adding filaments 
or (2) introducing microcracks that will blunt the tip 
of a propagating crack. 

Zirconia is polymorphic, existing in three crystalline 
forms; their interrelation, in order of decreasing tem- 
perature, is as follows: 

Melt , Cubic Tetragonal 

2680°C c 2370°C t 

950°C 

- ^ Monoclinic 

1150°C m 

The technique of transformation- toughening hinges 
upon stabilizing the high-temperature tetragonal (t) 
form so that it is metastable at room temperature. 
Stabilization, partial or whole, is achieved by adding 
certain oxides (Y 2 0 3 , MgO, CaO) to zirconia. In 
the metastable condition, the surrounding structure 
opposes the expansive transition from t- to m-forms. 
In the event of a propagating crack passing into or 
near metastable regions, the concentrated stress field 
at the crack tip enables t-crystals of zirconia-rich 
solid solution to transform into stable, but less dense, 
m-Zr0 2 (Figure 10.6). The transformation is marten- 
sitic in character. The associated volumetric expan- 
sion (3-5% v/v) tends to close the crack and relieve 
stresses at its tip. This transformation mechanism is 
primarily responsible for the beneficial toughening 
effect of a metastable phase within the micro structure. 

The relative stability of zirconia-rich solid solutions 
can be conveniently expressed in terms of the phase 



Figure 10.6 Crack propagating into grains of t-zirconia, 
causing them to transform into m- zirconia. 



Figure 10.7 Schematic phase diagram for ZrC >2 -T 2 O 3 
system: all phases depicted are solid solutions. TZP = 
tetragonal zirconia polycrystal, PSZ = partially -stabilized 
zirconia, CSZ = cubic -stabilized zirconia. 


diagram for the zirconia-rich end of the Zr0 2 -Y 2 0 3 
system (Figure 10.7). The same principles apply in 
a very general sense to the other two binary sys- 
tems, Zr0 2 -MgO and Zr0 2 -CaO. Yttria is partic- 
ularly effective as a stabilizer. Three zirconia-based 
types of ceramic have been superimposed upon the 
diagram; CSZ, TZP and PSZ. The term CSZ refers 
to material with a fully- stabilized cubic (not tetrag- 
onal) crystal structure which cannot take advantage 
of the toughening transformation. It is used for fur- 
nace refractories and crucibles. The version known 
as tetragonal zirconia polycrystal (TZP) contains the 
least amount of oxide additive (e.g. 2-4 mol% Y 2 0 3 ) 
and is produced in a fine-grained form by sinter- 
ing and densifying ultra-fine powder in the tem- 
perature range 1350-1500°C; such temperatures are 
well within the phase field for the tetragonal solid 
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solution (Figure 10.7). After cooling to room temper- 
ature, the structure is essentially single-phase, consist- 
ing of very fine grains (~0.2-l pm) of t-Zr0 2 which 
make this material several times stronger than other 
types of zirconia-toughened ceramics. A typical TZP 
microstructure, as revealed by electron microscopy, 
is shown in Figure 10.8. Added oxide(s) and sili- 
cate impurities form an intergranular phase which can 
promote liquid-phase sintering during consolidation. 
(A similar effect is utilized in the production of silicon 
nitride.) 

In partially-stabilized zirconia (PSZ), small t-crys- 
tals are dispersed as a precipitate throughout a matrix 
of coarser cubic grains. Zirconia is mixed with 8-10 
mol% additive (MgO, CaO or Y 2 0 3 ) and heat- 
treated in two stages (Figure 10.7). Sintering in the 
temperature range 1650-1850°C produces a parent 
solid solution with a cubic structure which is then 
modified by heating in the range 1100-1450°C. This 
second treatment induces a precipitation of coherent 
t-crystals (~200 nm in size) within the c-grains. The 
morphology of the precipitate depends upon the nature 
of the added solute (e.g. Zr0 2 -MgO, Zr0 2 -CaO and 
Zr0 2 -Y 2 0 3 solid solutions produce lenticular, cuboid 
and platey crystals, respectively). The average size of 
precipitate crystals is determined by the conditions of 
temperature and time adopted during heat-treatment in 
the crucial ‘t + c’ field of the phase diagram. 

In the third example of transformation- toughening, 
t- zirconia grains are dispersed in a dissimilar ceramic 
matrix; for example, in ZT(A) or ZT(A1 2 0 3 ) they 
are dispersed among alumina grains (Figure 10.9). 
An intergranular distribution of the metastable phase 
results when conventional processing methods are used 
but it has also been found possible to produce an 
intragranular distribution. As with PSZ materials, the 
size of metastable particles and matrix grains must be 
carefully controlled and balanced. 



Figure 10.8 Electron micrograph of tetragonal-zirconia 
poly crystal stabilized with 3 mol. % yttria ( with 
acknowledgement to M. G. Cain, Centre for Advanced 
Materials Technology, University of Warwick, UK). 
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Figure 10.9 Duplex structure ofZT ( A/ 2 0 3 ) consisting of 
alumina and t-zirconia grains ( back-scattered electron 
image), (from Green, 1984, p. 84; by permission of Marcel 
Dekker Inc.). 

So far, we have concentrated upon mechanical 
behaviour at or below ambient temperature. If the tem- 
perature of a zirconia-toughened material is raised to 
900-1000°C, which is close to the t-m transition tem- 
perature, the toughening mechanism tends to become 
ineffective. In addition, thermal cycling in service 
tends to induce the t-m transition at temperatures in the 
range 800-900°C and the toughening property is grad- 
ually lost. This tendency for fracture toughness to fall 
as the service temperature increases has naturally led to 
the investigation of alternative forms of stabilization in 
systems which have much higher transformation tem- 
peratures (e.g. ZT(Hf0 2 )). Intergranular residues (e.g. 
in TZP), despite their beneficial effect during sintering, 
become easier to deform as the temperature rises and 
the material then suffers loss of strength and resistance 
to creep. 

10.4.4 Glass-ceramics 

10.4.4.1 Controlled devitrification of a glass 
It has long been appreciated that crystallization can 
take place in conventional glassy structures, particu- 
larly when they are heated. However, such crystalliza- 
tion is initiated at relatively few sites and there is a 
tendency for crystals to grow perpendicular to the free 
surface of the glass in a preferred manner. The result- 
ing structure, being coarsely crystalline and strongly 
oriented, is mechanically weak and finds no practical 
application. 
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The basic principle of glass-ceramic production is 
that certain compositions of glass respond to con- 
trolled heat-treatment and can be converted, without 
distortion and with little dimensional change, from 
a readily- shaped glass into a fine-grained crystalline 
ceramic possessing useful engineering properties. The 
key to this structural transformation, which takes place 
throughout the bulk of the glass (volume crystalliza- 
tion), is the presence of a nucleating agent, or catalyst, 
in the original formulation. 

Controlled devitrification of a special glass involves 
two or more stages of heat-treatment (Figure 10.10). 
In the first stage, which can begin while the glass is 
cooling from the forming and shaping operation, hold- 
ing at a specific temperature for a definite time period 
causes the catalyst to initiate the precipitation of large 
numbers of nuclei throughout the glassy matrix. When 
these seed regions reach a certain size, different species 
of crystals may begin to grow upon them. Electron 
microscopy has demonstrated that epitaxial relation- 
ships exist between the first-formed crystals and suc- 
ceeding generations of crystals (Figure 10.11). Finally, 
in the second stage of heat- treatment (Figure 10.10), 
the structure is heated to a different temperature in 
order to induce further crystallization, crystal growth, 
crystal transitions and a gradual, almost complete, dis- 
appearance of the glassy matrix. 

Control of time and temperature is essential dur- 
ing the production by heat-treatment of a glass- 
ceramic. Figure 10.12 provides a general guide to the 
temperature-dependence of nucleation and growth pro- 
cesses for any melt, irrespective of whether it forms 
a glassy or crystalline solid. Curve N represents the 
rate of homogeneous nucleation; that is, the number 
of nuclei forming per second in each unit volume of 
glass. Curve G represents the rate of crystal growth 
(micron/second). Each curve has a peak value; for 
viscous glass-forming melts, these maxima are not 
very pronounced. With regard to the formation of a 



Figure 10.10 Temperature/time schedule for producing a 
glass-ceramic. 



Figure 10.11 Epitaxial growth of lithium metasilicate 
( LS = L^SiOs ) on a lithium orthophosphate seed crystal 
( LP = LP 3 PO 4 ) in SiC >2 —Li 2 O—AI 2 O 3 glass containing 
P 2 O 5 catalyst (from Headley and Loehmann, 1984, 
pp. 620-25; reprinted by permission of the American 
Ceramic Society). 
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Figure 10.12 Temperature-dependence of nucleation and 
growth processes (after Raw son, 1980). 

glass-ceramic, it is of prime importance to select a tem- 
perature which is close to the peak of the nucleation 
curve and then, in the second stage of heat-treatment, a 
temperature which does not encourage excessive grain 
growth. Usually the temperature chosen for the second 
stage is higher than that used for the first. A care- 
ful balance of conditions during heat-treatment will 
favour the production of the desired ultra-fine grain 
structure. Thus the rate of nucleation should be high, 
nuclei should be uniformly dispersed and the rate of 
crystal growth should not be excessive. Crystals are 
one micron or less in size; interlocking of these crystals 
will enhance the mechanical strength. 
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Fundamental studies are complicated by the fact 
that the detailed mechanisms by which nuclei form 
in a homogeneous glassy matrix, and then develop 
into crystals, appear to be specific to each type 
of glass. However, although spinodal decomposition 
is sometimes possible, it is usually regarded as a 
nucleation and growth process. Although the exact 
nature of the early stages of nucleation is highly 
debatable, the nuclei, once formed, enable hetero- 
geneous nucleation of the major crystalline phases 
to take place. Metastable phases may form during 
heat-treatment; such phases do not feature in phase 
(equilibrium) diagrams. Well-known crystalline phases 
may appear at unexpected temperatures. Nominally 
metastable phases may prove to be quite stable under 
service conditions. Furthermore, successful composi- 
tions are usually multi-component in character and 
modifying oxides, particularly the catalyst, can have 
a significant effect upon the types of crystal produced 
and upon their transformation processes. For a given 
catalyst, a change in the heat-treatment process can 
result in a change in the major crystalline phase(s). 

Nucleation in glasses which use an oxide as catalyst 
often appears to be preceded by a process of separation 
into two solid glassy phases (metastable immiscibility). 
These microphases differ in chemical composition and 
are therefore believed to have a desirable effect upon 
the ultimate grain size by favouring a high nucleation 
density and reducing the growth rate of crystals. Phase 
separation may occur during either cooling of the melt 
or reheating and it is logical to presume that it is 
more likely to take place when two network-formers 
are present in the glass formulation. The production of 
Vycor , a Si 02 -B 2 03 -Na 2 0 glass, also takes advantage 
of phase separation. 

10.4.4.2 Development of glass-ceramics 
Glass-ceramics date from the late 1950s and are an 
offshoot of research by S. D. Stookey at the Com- 
ing Glass Works, USA, on photosensitive glasses. In 
the normal procedure for these glasses, after prior 
irradiation with ultraviolet light, their structure can 
be altered by heat-treatment. Metals, such as cop- 
per, silver or gold, act as nucleating agents for lim- 
ited crystallization. It was fortuitously discovered that 
higher heat-treatment temperatures could induce com- 
plete crystallization. Subsequent research at Corn- 
ing established that certain oxides could also act as 
effective nucleating agents and led to development 
of the first glass-ceramics, which were based on the 
Si 0 2 -Al 2 03 -Li 0 2 system. Conveniently, these mate- 
rials do not require prior irradiation. Oxides which 
promote crystallization include titania, zirconia and 
phosphorus pentoxide. 

Practical considerations of production greatly 
influence the development and exploitation of glass- 
ceramics. As a general rule, the temperatures of 
melting and refining should not exceed 1400-1500°C. 
Fortunately, some nucleating agents, such as titania, 


have a strong fluxing action on silica. In addition, melt 
viscosity is important; alumina increases viscosity and 
will tend to slow down melting and refining operations. 
As composition control is a vital feature of glass- 
ceramics, it is essential to maintain melt composition 
reproducibly from batch to batch. Volatilization and 
interaction between the melt and the refractory lining 
of the melting furnace can make this difficult. For 
instance, high proportions of lithium oxide in the melt 
will increase attack on the lining. The melt leaving the 
melting furnace has a temperature-dependent viscosity 
of 10 13 poise and the cooling mass can be worked 
and shaped until its viscosity falls to about 10 8 poise. 
Although conventional glasses have a useful ‘long’ 
range of temperature over which they can be worked, 
the composition of potential glass-ceramics restricts 
(‘shortens’) this range, particularly when aluminium 
oxide is present. As a consequence, the choice of 
shaping process may be restricted to gravity or 
centrifugal casting. 

As metastability is an essential feature of a glass- 
ceramic, it is not surprising to find that certain 
compositions tend to devitrify prematurely during 
working. Oxides of aluminium, phosphorus and the 
alkali metals sodium and potassium inhibit devitrifica- 
tion in Si0 2 -Li0 2 glasses. On the other hand, the glass 
should crystallise neither too quickly (during cooling of 
the melt) nor too slowly (during heat- treatment). These 
tendencies can be eliminated by adding oxides which 
have a specific effect upon the strength of the glass net- 
work structure. For instance, lithium oxide introduces 
non-bridging oxygen ions into a network of Si 04 tetra- 
hedra and, by weakening it, favours crystallization. 

10.4.4.3 Typical applications of glass-ceramics 
The versatility and potential for development of glass- 
ceramics quickly led to their adoption in heat engines, 
chemical plant, electronic circuits, seals, cladding for 
buildings, aerospace equipment, nuclear engineering, 
etc. They can offer a remarkable combination of prop- 
erties. For instance, glass-ceramic hob plates for elec- 
tric cookers are strong, smooth and easy to clean, stable 
over long periods of heating, transparent to infrared 
radiation from the tungsten halogen lamp, relatively 
opaque to visible light (reducing glare) and resistant to 
thermal shock. The last property originates primarily 
from the low thermal coefficient of thermal expan- 
sion ( a ) of this particular glass-ceramic. In the ver- 
satile Si0 2 -Al 2 0 3 -Li0 2 class of glass-ceramics, the 
o'- value can be ‘tailored’ from zero to 12 x l 0 - 6 ° c - 1 
by controlling structure. Thus, in types containing 
about 10% alumina, crystals of lithium disilicate 
and quartz (or cristobalite) form to give a relatively 
high expansion coefficient. Increasing the alumina to 
about 20% favours the formation of two types of 
lithium aluminium silicate crystals, /3-spodumene and 
/3-eucryptite. Over the temperature range 20-1000°C, 
the o'- values for these two compounds are 0.9 x 
10 -6o C -1 and — 6.4 x 10 _6o C _1 , respectively. Careful 
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balancing of their relative amounts against the residual 
glass content can reduce the overall expansion coeffi- 
cient towards zero. This capability is ideal for the large 
mirror blanks used for telescopes where dimensional 
stability is essential. In heat engines such as lorry gas 
turbines, a low expansion coefficient is required for the 
regenerative heat exchanger which is alternately heated 
and cooled by exhaust gases and combustion air. In 
glass/glass and metal/glass seals, precise matching of 
expansion characteristics is possible. In the mass pro- 
duction of colour television tubes, devitrifiable solder 
glass-ceramics based upon the Pb0-Zn0-B 2 0 3 sys- 
tem have a fairly high a value and have been used to 
seal the glass cone to the glass face plate at a relatively 
low temperature without risk of distortion. The seal is 
subsequently heated to form the glass-ceramic. 

A fine-grained structure of interlocking crystals 
favours mechanical strength; modulus of rupture val- 
ues are comparable to those for dense alumina. In a 
machinable variety of glass-ceramic, interlocking crys- 
tals of platey mica deflect or blunt forming cracks 
and make it possible for complex machinable com- 
ponents to be designed. Chemical stability, as well 
as wear-resistance, is essential when service involves 
contact with fluids (e.g. valves, pumps, vessel linings). 
It is well-known that the fluxing oxides of sodium 
and potassium lower the chemical resistance of sil- 
ica glass to aqueous solutions; in glass-ceramics, this 
susceptibility is countered by stabilizing any residual 
glass phase with boric oxide. Attempts are being made 
to extend their use to much higher temperatures (e.g. 
1300-1700°C) by exploiting refractory systems, such 
as Si0 2 -Al 2 0 3 -Ba0, which can provide a liquidus 
temperature above 1750°C but such glasses are very 
difficult to melt. 

Conversion from glass to crystals causes mobile ions 
to disappear from the structure and become ‘bound’ 
to crystals, consequently the electrical resistivity and 
dielectric breakdown strength of glass-ceramics are 
high, even at temperatures up to 5 00 - 700° C. The 
dielectric loss is low. 

Utilization of plentiful and cheap waste byproducts 
from industry for the bulk production of glass-ceramics 
has stimulated much interest. Although the chemical 
complexity of materials such as metallurgical slags 
makes it difficult to control and complete crystalliza- 
tion, strong and wear-resistant products for architecture 
and road surfacing have been produced (e.g. Slagce- 
ram, Slagsitall). 

10.4.5 Silicon carbide 

10.4.5.1 Structure and properties of silicon 
carbide 

The two principal structural forms of this synthetic 
compound are a-SiC (non-cubic; hexagonal, rhombo- 
hedral) and /3-SiC (cubic). The cubic /3-form begins 
to transform to a-SiC when the temperature is raised 
above 2100°C. Conditions of manufacture determine 
the exact crystal structure and a number of variants 


have been identified by X-ray diffraction analysis. 
For example, many different stacking sequences are 
possible in hexagonal a-SiC (e.g. 4H, 6H). The nomen- 
clature for such variants (polytypes) indicates the num- 
ber of atomic layers in the stacking sequence and the 
crystal system (e.g. 15R, 3C). Tetrahedral grouping of 
carbon atoms around a central silicon atom is a com- 
mon and basic feature of these various structures. This 
covalent bonding of two tetravalent elements gives 
exceptional strength and hardness and a very high 
melting point (>2700°C). 

Despite its carbon content, silicon carbide offers 
useful resistance to oxidation. At elevated tempera- 
tures, a thin impervious layer of silica (cristobalite) 
forms on the grains of carbide. On cooling, the a/ p 
transition occurs in cristobalite and it can crack, allow- 
ing ingress of oxygen. Above a temperature of 1500°C, 
the silica layer is no longer protective and the carbide 
degrades, forming SiO and CO. 

For service applications where resistance to ther- 
mal shock is important, it is customary to compare 
candidate ceramic materials in terms of a parame- 
ter which allows for the effect of relevant properties. 
Many versions of this parameter have been proposed. 
A typical parameter ( R ) for sudden thermal shock is 
R = <j(1 — v)/Ea , where a is the modulus of rupture 
(N m~ 2 ), v is the Poisson ratio (0.24 for REFEL SiC), 
E is the modulus of elasticity (N m -2 ), and a is the 
linear coefficient of thermal expansion (K -1 ). In the 
case of silicon carbide, the product in the denomi- 
nator tends to be high, giving a low index. Silicon 
nitride gives a higher index and it is understandable 
that Si 3 N 4 -bonded silicon carbide is preferred when 
service involves thermal cycling. The severity of shock 
can affect the rating of different materials. Thus, for 
less rapid shock, a thermal conductivity term ( k ) is 
included as a multiplier in the numerator of the above 
parameter. 

10.4.5.2 Production of silicon carbide powder 
and products 

Silicon carbide is a relatively costly material because 
its production is energy-intensive. The Acheson carbo- 
thermic process, which is the principal source of com- 
mercial-quality silicon carbide, requires 6-12 kWh 
per kg of silicon carbide. Locations served by hydro- 
electric power, such as Norway and the Niagara Falls 
in Canada, are therefore favoured for synthesis plant. 
In this unique process, a charge of pure silica sand 
(quartz), petroleum coke (or anthracite coal), sodium 
chloride and sawdust is packed around a 15 m long 
graphite conductor. A heavy electric current is passed 
through the conductor and develops a temperature in 
excess of 2600°C. The salt converts impurities into 
volatile chlorides and the sawdust provides connected 
porosity within the charge, allowing gases/vapours to 
escape. The essential reaction is: 

Si0 2 + 3C 


or-SiC + 2CO 
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The reaction product from around the electrode is 
ground and graded according to size and purity. The 
colour can range from green (99.8% SiC) to grey (90% 
SiC). 

Numerous chemical conversion and gas-phase syn- 
thesis processes have been investigated. The temper- 
atures involved are generally much lower than those 
developed in the Acheson process; consequently, they 
yield the cubic /3-form of silicon carbide. Chemical 
vapour deposition (CVD) has been used to produce 
filaments and ultra-fine powders of /3-SiC. 

Silicon carbide shapes for general refractory appli- 
cations are produced by firing a mixture of SiC grains 
and clay at a temperature of 1500°C. The resultant 
bond forms mullite and a glassy phase, absorbing the 
thin layer of silica which encases the grains. Other 
bonding media include ethyl silicate, silica and sili- 
con nitride. The latter is developed in situ by firing a 
SiC/Si compact in a nitrogen atmosphere. This parti- 
cular bond is strong at high temperatures and helps to 
improve thermal shock resistance. 

Specialized and often costly processing methods are 
used to produce fine dense ceramics for demanding 
engineering applications. The methods available for 
forming silicon carbide powders include dry-pressing, 
HIPing, slip-casting, extrusion and injection-moulding. 
The last process requires an expendable polymeric 
binder and is particularly attractive in cases where long 
production runs of complex shapes are envisaged (e.g. 
automotive applications). With regard to firing, the 
main methods are akin to those developed for silicon 
nitride; each, in its own way, is intended to maximize 
the quality of interparticle bonding. They include hot- 
pressing (HP SiC), pressureless-sintering (S SiC) and 
reaction- sintering (Si SiC). 

The hot-pressing method for producing a-SiC 
blanks of high density was originally developed by the 
Norton Co., USA. A small amount of additive (boron 
carbide (B 4 C) or a mixture of alumina and aluminium) 
plays a key role while the carbide grains are being 
heated (>2000°C) and compressed in induction-heated 
graphite dies. It has been suggested in the case 
of HP SiC (and S SiC) that the boron encourages 
grain boundary/surface diffusion and that the carbon 
breaks down the silica layers which contaminate grain 
surfaces. These additives leave an intergranular residue 
which determines the high-temperature service ceiling. 
The hot-pressed blanks usually require mechanical 
finishing (e.g. diamond machining). Production of 
complex shapes by hot-pressing is therefore expensive. 

The pressureless-sintering route (for S SiC) uses 
extremely fine silicon carbide powders of low oxygen 
content. Again, an additive is necessary (B 4 C or alu- 
minium + carbon) in order to promote densification. 
The mixture is cold-pressed and then fired at approxi- 
mately 2000° C in an inert atmosphere. 

The REFEL process for producing siliconized sili- 
con carbide (Si SiC) was developed by the UKAEA 


and is an example of reaction- sintering (reaction- 
bonding). A mixture of a-SiC, graphite and a plas- 
ticizing binder is compacted and shaped by extrusion, 
pressing, etc. This ‘green’ compact can be machined, 
after which the binder is removed by heating in an 
oven. The pre-form is then immersed in molten silicon 
under vacuum at a temperature of 1700°C. Graphitic 
carbon and silicon react to form a strong intergran- 
ular bond of /3-SiC. A substantial amount of ‘excess’ 
silicon (say, 8-12%) remains in the structure; the max- 
imum operating temperature is thus set by the melting 
point of silicon (i.e. 1400°C. Beyond this temperature 
there is a rapid fall in strength. Ideally, neither unre- 
acted graphite nor unfilled voids should be present in 
the final structure. The dimensional changes associated 
with the HP process are small and close tolerances can 
be achieved; the shrinkage of 1-2% is largely due to 
the bake-out of the binder. 

10.4.5.3 Applications of silicon carbide 
Silicon carbide has been the subject of continuous 
development since it was first produced by the then- 
remarkable Acheson process in 1891. 1 Now it is avail- 
able in a wide variety of forms which range from 
monolithics to single-crystal filaments (whiskers). It 
is used for metal-machining, refractories and heating 
elements in furnaces, chemical plant, heat exchangers, 
heat engines, etc. 

The extreme hardness (2500-2800 kgf mm -2 ; 
Knoop indenter) of its particles and their ability 
to retain their cutting edges at high contact 
temperatures (circa 1000°C) quickly established silicon 
carbide grits as important grinding media. The 
comparatively high cost of silicon carbide refractories 
is generally justified by their outstanding high- 
temperature strength, chemical inertness, abrasion 
resistance and high thermal conductivity. In the New 
Jersey process for producing high-purity zinc, SiC is 
used for components such as distillation retorts, trays 
and the rotating condensation impellers which have to 
withstand the action of molten zinc and zinc vapour. 
In iron-making, silicon carbide has been used to line 
the water-cooled bosh and stack zones of iron-smelting 
blast furnaces, where its high thermal conductivity and 
abrasion-resistance are very relevant. However, it can 
be attacked by certain molten slags, particularly those 
rich in iron oxides. This characteristic is illustrated 
by experience with the skid rails which support steel 
billets in reheating furnaces. This type of furnace 
operates at a temperature of 1250°C. Water-cooled 
steel rails have traditionally been used but warp, 
wear rapidly and tend to form ‘cold spots’ where 

Carborundum Co. Ltd: The Americans E. G. Acheson and 
W. A. McCallister gave the name Carborundum to their 
new material, assuming that it was a combination of carbon 
and corundum (AI 2 O 3 ). Even after its true chemical identity 
was established, Acheson retained the name, regarding it as 
‘phonetic, of pleasing effect in print, even though a trifle 
lengthy’ . 
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they contact the billets. Replacement with uncooled 
silicon carbide rails solved these problems but it was 
found that iron oxide scale from the billets could 
melt and attack silicon carbide. Some improvement 
was achieved by flush-mounting the rails in the 
furnace floor. 

Silicon carbide also has a key role in recent designs 
of radiant tube heaters in gas-fired furnaces. The com- 
bination of a 60 kW recuperative burner and a radi- 
ant tube (1.4 m long x 170 mm diameter) made from 
Si 3 N 4 -bonded silicon carbide is shown in Figure 10.13. 
This British Gas design allows outgoing combustion 
products to preheat incoming air, giving high thermal 
efficiency, and also keeps these gases separate from the 
atmosphere within the furnace chamber. The maximum 
surface temperature for the radiant tube is 1350°C. 

Silicon carbide is electrically conductive and care 
has to be taken when it is used as a refractory in the 
structure of electrometallurgical plant. However, the 
combination of electrical conductivity and refractori- 
ness offers special advantages. For example, silicon 
carbide resistor elements have been used since about 
1930 in indirect resistance-heated furnaces throughout 
industry (e.g. Globars). These elements act as energy- 
conversion devices, heating the furnace charge by radi- 
ation and convection. They can operate in air and inert 
gas atmospheres at temperatures up to 1650°C but 
certain conditions can shorten their life (e.g. carbon 
pick-up from hydrocarbon gases, oxidation by water 
vapour). Service life also tends to decrease as the oper- 
ating temperature is increased. Provided that service 
conditions are not too severe, a life of at least 10000 h 
can be anticipated. A double-helical heating section 
is available as an alternative to the standard cylindri- 
cal shape and allows both electrical connections to be 


made at the same end. Elements should be of reason- 
able diameter/length and be neither too fragile nor too 
massive. 

The presence of impurity atoms in silicon carbide 
enables electron flow to take place; it is accordingly 
classed as an extrinsic semiconductor. The electrical 
resistance of silicon carbide is very temperature- 
dependent, decreasing from room temperature to 
650°C and then slowly increasing with further rise 
in temperature. Because of this characteristic and 
the great sensitivity of cold-resistance to traces of 
impurities, a typical production procedure is to check 
the resistance of each element (in air) with an electrical 
load per cm 2 of radiating surface which is equivalent 
to a typical operating surface temperature (e.g. 
15.5 W cm' 2 and 1070°C). This nominal resistance 
is then used to calculate the number and size of 
elements required. As the element ages, its resistance 
slowly increases. A constant rate of energy input to the 
furnace is maintained by increasing the voltage applied 
across the elements (e.g. by multi-tap transformer). 

Specialized forms of silicon carbide now find 
widespread use in engineering. At ambient temper- 
atures, they serve in machine components subjected 
to abrasive wear (e.g. mechanical seals, bearings, 
slurry pump impellers, wire dies, fibre spinnerets). 
In high-temperature engineering, silicon carbide is 
now regarded, together with silicon nitride and the 
sialons, as a leading candidate material for service 
in heat engine designs which involve operation at 
temperatures in excess of 1000°C (e.g. glow plugs, 
turbocharger rotors, turbine blades and vanes, rocket 
nozzles). Glow plugs minimize the hazards of ‘flame- 
out’ in the gas turbine engines of aircraft. Their func- 
tion is to reignite the fuel/air mixture. They must 
withstand considerable thermal shock; for instance, 



Figure 10.13 Operating principle of the ceramic radiant tube (from Wedge , Jan 1987, pp. 36-8; by courtesy of the Institute of 
Materials). 
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on engine start-up the temperature rises from ambi- 
ent to 1600°C in 20 s and falls to 900°C in less than 
1 s if flame-out occurs. Si 3 N 4 -bonded silicon carbide 
performed well in this application. Large-scale uti- 
lization in gas-turbine and diesel engines has been 
greatly inhibited by the inherent brittleness of silicon 
carbide. Addition of a second phase to the structure, 
the composite approach, is regarded as a likely way 
to solve this problem of low fracture toughness. In a 
more general sense, it is accepted that the methodology 
and practice of non-destructive evaluation (NDE) and 
proof-testing for ceramic components require further 
refinement. 

10.4.6 Carbon 

10.4.6.1 The versatility of carbon 

Lying in the central zone of the Periodic Table between 
metallic and non-metallic elements, carbon has been 
aptly described as the chameleon among materials. 
Over the years it has become available in numerous 
forms yet still retains its capacity to surprise. The 
recent discovery and production of an exciting and 
completely new structural form, buckminsterfullerene, 
illustrates this point. Carbon is customarily classed as a 
ceramic because of its long-established use as a refrac- 
tory: present-day use of clay/graphite crucibles as con- 
tainers for molten metals echoes its use for the same 
purpose in the sixteenth century. Its high-temperature 
capabilities are exceptional. Although carbon is the 
traditional fuel for combustion and, ideally, should 
only be used as a heat-resistant engineering material in 
atmospheres of low oxygen content, the rate at which 
it is wasted by oxidation is very often acceptable to 
industry and compares favourably with that of many 
metals. 

Materials science tends to emphasize the two con- 
trasting crystalline forms (allotropes) of carbon, dia- 
mond and graphite, particularly as they represent 
extremes of hardness and softness. Both forms occur 
naturally and can be synthesized. Industry has long 
recognized their unique properties and has successfully 
developed an extremely wide and potentially confus- 
ing range of specialized carbon products. Within this 
range of structurally-different products, the degree of 
crystallinity may range from highly-developed to min- 
imal. The following categories of carbon product will 
now be examined in order to illustrate the general 
significance of structure and the ways in which it 
can be manipulated: natural and synthetic diamonds, 
baked and graphitized carbons, pyrolytic graphite, vit- 
reous carbon, intercalation compounds, buckminster- 
fullerene. 

10.4.6.2 Natural diamond 

The cubic form of carbon is renowned for its 
hardness, strength and beauty. The carbon-carbon 
bond lengths are all the same (0.1555 nm) so that any 
sections taken through the tetrahedrally coordinated, 
highly symmetrical structure cut a large number of 


these strong bonds; accordingly, there are no planes 
of easy cleavage. In commercial terms, there are 
three main qualities of mined diamonds: gemstones, 
industrial stones and boart (bort, bortz). Rough 
stones of gemstone quality are an exclusively natural 
product and their appearance is enhanced by a 
highly-demanding cutting procedure which involves 
cleavage on meticulously- selected planes, such as the 
{111}, and sawing/grinding/polishing with a mixture 
of diamond paste and olive oil. During cutting at 
least half of the mass is lost. 1 The symmetrical 
faceted shapes produced by the lapidary comply with 
standard mathematically-substantiated patterns (cuts) 
which optimise the optical effects of internal reflection 
(‘life’) and refraction (‘fire’). Three cuts currently 
favoured are brilliant, emerald or baguette. 

Single industrial stones of near-gem quality are used 
as tools for trueing and dressing grinding wheels, 
engraving, rock-drilling and as dies for drawing wires 
of copper, steel, tungsten, etc. In the important pro- 
cess of trueing and dressing, a tool holding a single 
diamond is held against the working face of the grind- 
ing wheel as it rotates. The wheel profile is corrected 
and grit particles are left projecting slightly above the 
bonding matrix. Natural single crystals are used for 
drawing the smaller sizes of wire (e.g. <0.2 mm). 
These dies have a crystallographically-oriented aper- 
ture which is typically capable of passing 10000 km 
of wire before dimensional tolerances are exceeded. 

Diamond indenters are used in macrohardness and 
microhardness testing machines (Section 5. 2.2.4). Sur- 
face textures of machined metals are commonly char- 
acterized by profilometers which traverse a sampling 
length with a fine diamond stylus; interpretation of the 
resultant high-resolution trace should be tempered by 
the observation that the moving stylus tends to plough 
a furrow in the test surface. 

Boart is generally <1.5 mm in size, badly flawed 
and imperfectly crystallized. For many years, boart was 
virtually unsaleable. Then, in the 1930s, its use as an 
abrasive grit or powder for impregnating the working 
faces of grinding wheels was fostered. Thereafter it 
found increasing application on a large scale for saw- 
ing, drilling and machining operations and became an 
accepted abrasive medium for non-ferrous metals and 
alloys, hard carbides, concrete, rock, glass, polymers, 
etc. The working face of a bonded grinding wheel that 
is being propelled into the workpiece is a compos- 
ite structure, consisting of abrasive particles of grit 
(diamond, cobalt-bonded tungsten carbide, alumina or 
silicon carbide) set in a bonding matrix. The matrix 
can be sintered metal, electroplate, vitreous or resinoid. 
Selection of the best combination of grit and bond can 
draw from a large pool of practical experience and 
broadly depends upon the material being machined 
and upon the machining conditions. It is reasonable 

x For diamonds, the carat (ct) is a unit of mass, with 1 ct = 
0.2 g: for alloyed gold, the carat is one twenty-fourth part 
by mass of gold. 
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to regard each particle of diamond grit as a single cut- 
ting tool. The tendency of some qualities of diamond 
to fracture and regenerate new cutting edges, rather 
than be tom out of the matrix, can be advantageous in 
certain machining operations. The high thermal con- 
ductivity of diamond (which is greater than that of 
copper) and low coefficient of friction minimize the 
generation and dissipation of heat from each of a myr- 
iad of such tools. Characteristics such as these have 
encouraged the trend toward higher rates of feed and 
greater peripheral wheel speeds, to the benefit of pro- 
ductivity, dimensional accuracy and surface texture. 
In simple terms, a higher cutting speed means that 
each particle is subjected to stress and heat effects for 
shorter periods of time. (The availability of synthetic 
grits has aided this particular development in grinding 
practice.) 

Lapping and polishing operations require diamond 
powders; under these fine-scale cutting operations, heat 
generation is not usually a problem. In the metallo- 
graphic polishing of metallic and ceramic specimens, 
a progression from coarse to sub-micron powders 
smoothes the surface as well as gradually reducing the 
depth of unwanted surface distortion. 

10.4.6.3 Synthetic diamond 

The quest for a method to synthesize diamonds from 
carbonaceous material dates back to the nineteenth 
century. After World War II, research was stimulated 
by political uncertainties in Africa that threatened to 
jeopardize supplies of boart. The first announcement of 
the successful synthesis of diamonds from graphite was 
made by the General Electric Company, USA, in 1955. 
(Later, it transpired that the ASEA, Stockholm, Swe- 
den had achieved a comparable result two years earlier 
but had not publicized it.) These remarkable methods 
simultaneously subjected graphite to extremely high 
static pressures and high temperatures. 1 

The physical conditions necessary for synthesis are 
customarily discussed in terms of the type of phase 
diagram shown in Figure 10.14. The key feature is 
the ascending Berman-Simon line 2 representing equi- 
librium between diamond and graphite. Diamond is 
stable above this line and graphite is stable below it. 
Diamond is able, of course, to exist below the line 
in a metastable condition at ordinary pressures; how- 
ever, it was found experimentally that heating above 
a temperature of 1800 K caused rearrangement of its 
extremely strong C-C bonds and transformation into 
graphite. From the diagram it may be deduced that 
the reverse transition, from graphite to diamond, will 
take place above the line at similar temperatures if a 

^uch was owed to the pioneering work of the Nobel Prize 
winner P. W. Bridgman (1882-1961) at Harvard 
University, USA, on methods for developing ultra-high 
pressures. 

2 So named in recognition of the theoretical contribution of 
two Oxford physicists, R. Berman and F. Simon. 



Figure 10.14 Pressure versus temperature diagram for 
carbon ( after Bovenkerk et al., 1959, pp. 1094-8). 

pressure of at least 60 kb is applied simultaneously. 
The GEC method achieved these conditions. It also 
relies upon an addition of a metal (e.g. Ni, Cr, Mn, 
Fe, Co) which acts as a molten solvent for carbon 
and a catalyst for diamond crystallization and growth. 
The liquidus line for the eutectic mixture of carbon 
and nickel is superimposed on the diagram in order 
to define the diamond- growing region (shaded). Early 
synthetic crystals were usually grown under conditions 
of temperature and pressure well above the Berman- 
Simon line and were consequently weak and friable, 
containing stacking faults and metallic inclusions. As 
improved methods for measuring process temperature 
and pressure became available, experience showed that 
synthesis in the shaded region just above the Berman- 
Simon line gave slower and more controllable growth. 

Later GEC experiments at pressures up to 200 kb 
transformed well-crystallized graphite into small 
diamond crystallites, about 0.1 mm in size, with a 
wurtzite-type structure showing hexagonal symmetry. 
Subsequently, natural ‘hexagonal’ diamonds 3 (in 
association with cubic diamonds) were identified in 
meteorites; presumably they formed on impact with 
the earth. Small ‘hexagonal’ diamonds have also been 
synthesized by explosive shock-loading techniques 
that are capable of developing pressures as high as 
500-1000 kb. These conditions only apply over a 
period of a few microseconds and thus tend to restrict 
a physical transition which is time-dependent. 

Synthesis is now practised worldwide and is capa- 
ble of producing diamonds which meet precise physi- 
cal and chemical requirements. Mined diamonds are 
inevitably more variable in quality. The maximum 
size of synthetic diamonds is in the order of 1 mm. 
Demand for industrial diamonds far exceeds that for 

3 Called lonsdaleite in honour of the eminent 
crystallographer, Professor Dame Kathleen Lonsdale 
(1903-1971). 
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gemstones; approximately 85-90% of industrial dia- 
monds are synthesized and are mainly consumed in 
abrasion processes. Synthetic and natural diamond 
abrasives compete with silicon carbide and alumina. 
As mentioned elsewhere, diamond machining is a final 
operation in the production of many of the new engi- 
neering ceramics. 

10.4.6.4 Scientific classification of diamonds 
The structural imperfections to be found in diamonds 
include crystalline inclusions, cracks, impurity atoms 
and vacant sites. Broadly, inclusions and cracks act as 
stress-concentrators and mainly affect the mechanical 
properties, whereas fine-scale defects, notably impu- 
rity atoms of nitrogen and boron, influence physi- 
cal properties such as optical absorption, electrical 
conductivity, etc. Inclusions in natural diamonds are 
particles of mineral matter. Inclusions in synthetic dia- 
monds derive from the metals used as catalysts (i.e. Ni, 
Co, Fe). 

The generally- accepted classification of diamonds, 
which is shown in Table 10.2, recognizes four main 
categories. It is based upon absorption characteristics 
determined over the ultraviolet, visible and infrared 
regions of the electromagnetic spectrum. The choice 
of this approach is perhaps not surprising when one 
considers the visible response of cut diamonds to white 
light. Absorption spectra are highly structure- sensitive 
and have made it possible to classify different qualities 
of natural and synthetic diamond in terms of their 
content of fine defects, such as impurity atoms and 
vacant sites. Each type of defect provides so-called 
‘optical centres’ which decide the specific manner in 
which components of incident radiation are absorbed 
and/or transmitted by the crystal structure. A diamond 
frequently contains more than one type of defect, hence 
interpretation of absorption spectra can sometimes be 
difficult and rather arbitrary. 

The decrease in intensity for a particular wave- 
length, as a result of absorption, is expressed by 
the classic exponential relation I = / O e -o; *, where I 
is the intensity of transmitted radiation, 7 0 is the 
intensity of incident monochromatic radiation and 
a is the coefficient of absorption, x is the path 
length. Long path lengths within the diamond increase 
the amount of optical interaction, producing a size 


effect which explains why larger diamonds tend to be 
more colourful. 

It will be seen that the classification first dis- 
tinguishes between Type I diamonds which contain 
nitrogen (say, up to 0.1 -0.2%) and Type II diamonds 
which have an extremely low nitrogen content. Type 
I diamonds are further sub-divided according to the 
spatial distribution of nitrogen atoms. In natural dia- 
monds, geologic periods of time at high temperatures 
and pressure have permitted nitrogen atoms to cluster, 
sometimes, apparently, into platelets. In most synthetic 
diamonds, the nitrogen atoms are more dispersed. 
Nitrogen, singly or in groups, can occupy substitu- 
tional or interstitial sites and are responsible for at least 
five types of optical centre (e.g. A, B, N, N3, platelet). 
(Thus, the designation Type IaB signifies that the dia- 
mond contains B centres wherein a few nitrogen atoms 
have clustered and replaced carbon atoms.) 

From the distinction between Type Ha and Type lib 
diamonds, it will be seen that the ability of diamond 
to act as either an insulator or a semiconductor is 
governed by its impurity content. In Type II diamonds, 
the concentration of impurity atoms is smaller than in 
Type I diamonds, usually being expressed in parts per 
million. In terms of the electron theory of conduction, 
pure diamond has a filled valence band that is separated 
from a partly filled conduction band by a substantial 
energy gap of 5.5 eV. The mean available thermal 
energy is approximately 0.025 eV and is therefore 
insufficient to transport electrons through the gap. Pure 
diamond is consequently an electrical insulator at room 
temperature. In a similar sense, photons of white light 
have associated wavelengths ranging from 400 nm 
(= 3.1 eV) to 730 nm (= 1.7 eV) and pass through 
the crystal without bringing about electronic transitions 
which bridge the gap. Absorption of energy from these 
incident photons is small. However, photons associated 
with shorter wavelengths in the ultraviolet range can 
exceed the energy requirement of 5.5 eV and the gap 
is bridged. 

Moving on from the special case of pure dia- 
mond to Type I diamonds, the introduction of nitrogen 
atoms (Z = 7) into the carbon structure (Z = 6) allows 
tetrahedral bonding to be maintained but also adds 
extra electrons. Optical absorption now becomes likely 
because electrons can move to higher levels within the 
band gap. However, the presence of nitrogen atoms 


Table 10.2 Classification of diamonds 


Type I ( nitrogen present ) 


Type II ( negligible nitrogen content ) 

Type la 

Type lb 

Type Ha 

Type lib 

Clustering of N atoms 

N atoms dispersed and 
substituting for 


Contain boron 


C atoms 

Non-conducting 

Semiconductivity possible in 
doped synthetic diamonds 

Most natural stones 

Most synthetic 

diamonds; rarely natural stones 

Rarely found in nature 
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does not confer conductivity. In Type lib diamonds, the 
principal impurity is boron (Z = 5). Measured atomic 
concentrations of boron are often less than 0.5 x 10 -6 . 
Each boron atom can contribute only three electrons to 
the surrounding four carbon atoms and therefore pro- 
vides an acceptor site within the band gap, allowing 
‘holes’ to form in the valence band. Semiconductiv- 
ity then becomes possible. The two impurity elements 
nitrogen and boron thus have contrasting effects upon 
the diamond structure, providing donor and acceptor 
sites, respectively. When both elements are present, 
some degree of compensation occurs. For instance, a 
surfeit of nitrogen can neutralize the effect of boron 
atoms and the semiconducting characteristic is absent. 
If a diamond is colourless and electrically conductive 
one might reasonably infer that it contains traces of 
nitrogen and uncompensated boron and is classifiable 
as Type lib. 

Specific types of diamond have proved successful 
in highly specialized applications. For instance, Type 
la and Type Ha diamonds have very high thermal con- 
ductivities (1000 and 2000 W m -1 , K _1 , respectively, 
at room temperature) and have been used as heat sinks 
in electronic devices that release much thermal energy 
within a confined space. Detector ‘windows’ of Type II 
diamond are virtually transparent to infrared radiation 
and have been used in conditions as disparate as those 
of diamond synthesis anvils and space probes. Type lib 
diamonds have interesting semiconductive prospects, 
possibly being capable of operating at higher temper- 
atures than silicon devices, but developments in this 
area have been restricted by the difficulty of develop- 
ing n-type devices to complement the /7-type devices 
described above. 

10.4.6.5 Polycrystalline diamond (PCD) 

Poly crystalline aggregates of diamonds rarely occur in 
nature. It is now possible to synthesise a fine-grained 
aggregate by sintering diamond crystallites (2-25 mm 
in size) at a high temperature and pressure in the pres- 
ence of a metal such as cobalt. The resultant blanks 
of PCD are typically about 50 mm diameter and can 
be electric discharge machined (EDM) to the required 
shape and size, thus overcoming the problems associ- 
ated with producing large single crystals. During sin- 
tering, the randomly-oriented grains deform plastically 
and interlock. Molten cobalt acts as a solvent/catalyst 
and also fills the interstices between the grains. This 
structure has a better wear resistance than either natu- 
ral diamond or tungsten carbide. It is also appreciably 
tougher than single crystals of diamond because the 
random orientation of grains forces any propagating 
crack to take an erratic, transgranular path which con- 
sumes strain energy. 

PCD is used to make metal-working dies for draw- 
ing the larger sizes of metallic wire (e.g. >0.2 mm). 
(With smaller sizes of wire, a natural single crystal of 
diamond gives a better surface texture.) When used 
for cutting tools, a thin layer of PCD is supported by a 


thicker layer of tungsten carbide. The latter is brazed 
onto the tool body. The standard quality of PCD is 
subject to a temperature ceiling of 700° C that is set 
by two factors. First, cobalt expands more than the 
diamond phase and higher temperatures cause stress 
cracks to form. Second, above 700° C, cobalt promotes 
the conversion of diamond into graphite. These prob- 
lems were solved by either removing the cobalt phase 
by acid leaching or by using silicon instead of cobalt. 
This variety of PCD is thermally stable at tempera- 
tures up to 1200°C and has been successfully used for 
drilling bits in hard-rock mining. 

10.4.6.6 Baked carbons and graphitized carbons 
Crucibles made from natural graphite bonded with fire- 
clay have been used in metal-melting operations for 
centuries. Although their strength is not exceptional, 
they can withstand temperatures of up to 1200°C. 
Other applications of natural graphite include lubri- 
cants, brake linings, bearings, foundry facings and pen- 
cil ‘leads’. Crystallinity is well developed and they are 
relatively unreactive. 

Most of the great demand for manufactured car- 
bons is met by synthesis. Applications are extremely 
diverse: brushes for electric motors and generators, 
rocket nozzles and nose cones for space vehicles, 
refractory linings for furnaces, moulds for metal- 
founding and hot-pressing, moderators and reflectors 
for nuclear reactors, electrodes for arcwelding, etc. 
There are two main types of bulk carbon products for 
industrial applications. Neither melting nor sintering 
are involved in their manufacture. 

Baked carbons, the first category, are produced by 
extruding/moulding a mix of carbonaceous filler or 
grist (e.g. petroleum coke, anthracite, pitch coke, etc.) 
and binder (e.g. coal-tar pitch, thermosetting resin). 
The shaping method determines the general anisotropy 
of the final product because the coke particles usu- 
ally have one axis longer than the other, consequently 
extrusion orients the particles parallel to the extrusion 
direction whereas moulding orients particles perpen- 
dicular to the compression forces. (Accordingly, the 
final product usually has a so-called ‘grain’.) The 
‘green’ shape is heated (baked) in a reducing atmo- 
sphere at a temperature of 1000-1 200° C. During this 
critical operation, the pitch decomposes or ‘cracks’ and 
provides carbon linkages between the particles of filler. 
Copious amounts of volatiles are evolved, producing 
about 20-30% porosity. Porosity weakens the struc- 
ture, influences electrical conductivity and, because of 
its connected nature, makes the structure permeable to 
gases and liquids. It may be reduced by impregnating 
the baked carbon with more pitch and baking again. 
Further heat treatment at substantially higher temper- 
atures of 25 00 -3 000° C causes the structure to graphi- 
tize. An electric resistance furnace is necessary for 
this purpose, hence the products are sometimes called 
electrographites. (Most designs of electric furnaces for 
larger sizes of product are modifications of the original 
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Acheson design of 1895, a furnace which has a special 
place in the history of the carbon industry.) 

Final shapes, whether baked or graphitized, can be 
readily and accurately machined. The final proper- 
ties of the carbon product depend upon the interplay 
of variables such as initial grain size, type of filler 
or binder, purity, furnace temperature, etc. Thus, the 
higher processing temperatures that are used for elec- 
trographites eliminate strain, heal imperfections and 
promote graphitization within the structure. As a result, 
electrographites have much higher electrical and ther- 
mal conductivities than baked carbons, making it pos- 
sible to halve the cross-sectional area of an electrode 
and gain valuable savings in mass and size. Accord- 
ingly, electrographites are favoured for large steel- 
melting furnaces. However, both baked carbons and 
the more costly electrographites find general appli- 
cation in a wide variety of electric furnaces. Their 
availability and continuous development have made 
a vital contribution to the electric are processes used 
to mass-produce steel, aluminium, carbides of silicon 
and calcium, alkali metals and their hydroxides, phos- 
phorus, magnesium, chlorine, etc. In addition to being 
refractory, electrodes must withstand chemical attack, 
mechanical damage during charging of the furnace, 
severe gradients in temperature and thermal shock. 
In the last respect, carbon electrodes possess a valu- 
able combination of properties; that is, low thermal 
expansivity, low modulus of elasticity, good high- 
temperature strength and high thermal conductivity. 
The low electrical resistivity minimizes resistive heat- 
ing and thus helps to restrict the temperature of the 
electrode. 

The chemical industry uses a variety of impregnation 
techniques to combat the problem of intrinsic porosity 
so that corrosive liquids such as caustic solutions and 
sulphite liquors can be handled. Thermoset resins are 
used to render the carbon impervious to fluids but, being 
organic, tend to decompose if the service temperature 
rises above 170°C. Surfaces can also be sealed with 
more stable compounds such as carbides, silicides, 
borides and nitrides; for example, impregnation with 
molten silicon at a temperature of 2000° C forms silicon 
carbide in situ and makes the surface impervious. 

10.4.6.7 Pyrolytic graphite and vitreous carbon 
Most of the graphitic carbons produced for industry 
are polycrystalline and, as indicated previously, often 
exhibit bulk anisotropy, usually as a result of the shap- 
ing process. Anisotropy is the outstanding character- 
istic of individual graphite crystals; in fact, graphite 
can exhibit more anisotropy in certain properties than 
most other types of crystal. In poly crystalline prod- 
ucts, it has often been necessary to cancel out dan- 
gerous anisotropic effects by deliberately randomizing 
the orientation of the grains (e.g. moderator graphite 
for nuclear reactors). A converse approach has led to 
the successful production of graphite with enhanced 
anisotropy (i.e. pyrolytic graphite (PG)). Compared to 


conventional graphite, PG is less chemically active and 
mechanically stronger. It can also be machined to close 
dimensional tolerances. In the 1960s, for the first time, 
it became possible to produce pieces of near-perfect 
graphite that could be used to explore the physics and 
chemistry of anisotropy in greater detail. 1 

In the basic method of production, pyrolytic graphite 
is deposited by bringing a mixture of hydrocarbon gas 
(methane, CH 4 , or propane, CsHg) and an inert car- 
rier gas into contact with a substrate material which 
is heated to a temperature of approximately 2000° C. 
Molecules of the hydrocarbon are thermally ‘cracked’ 
within the boundary layer of flow at the substrate 
surface, (e.g. CH 4 = C gr + 2H 2 . As indicated previ- 
ously, the physical character of the deposit is deter- 
mined by the process conditions, such as temperature, 
partial pressure of hydrocarbon gas, substrate mate- 
rial and area. These conditions, which are extremely 
critical, decide whether the deposit-forming carbon 
reaches the substrate surface as individual atoms, 
planar fragments or as three-dimensional clusters. 
Thus, planar fragments form laminar deposits which 
are highly anisotropic whereas larger clusters form 
isotropic deposits with little or no overall anisotropy. 
If the substrate takes the form of a resistance-heated 
carbon mandrel, a tubular deposit can be produced. 

Pyrolytic graphite, as-deposited, consists of crystal- 
lites which exhibit some degree of disorder as well as 
stratification. In order to explain the nature of its struc- 
ture, it is helpful to compare it with that of graphite 
in its most stable form. The perfect crystal has a reg- 
ular ABABAB. . . stacking of hexagonal {0002} layers 
in which the distance of closest approach between 
layers is approximately 0.335 nm. This exact registry 
of hexagonal planes, with its ABABAB. . . mode of 
stacking, is not found in pyrolytic graphite. Instead, 
synthesis by chemical vapour deposition (CVD) pro- 
duces a ‘ turbo stratic’ arrangement of layers in which 
the hexagonal networks, although still parallel, do 
not register precisely with each other (Figure 10.15). 
This disregistry has been likened to a pack of play- 
ing cards in which the long and short edges are each 
misaligned. Such structures are both disordered and 
stratified, hence the term ‘turbostratic’. Each crystallite 
comprises a number of hexagonal layers and the inter- 
layer distance is greater than the ideal of 0.335 nm. It 
is useful to assign a c-axis to the stack of layers form- 
ing each crystallite and to determine average values 
for this dimension (L c ) by X-ray diffraction analysis. 
In general terms, a turbostratic structure in graphite 
is envisaged as a conglomerate of crystallites sepa- 
rated by fairly stable transition zones of carbon atoms. 
If annealed at a very high temperature, say 3200°C, 

Professor A. R. J. P. Ubbelohde, FRS (1907-1988) and 
co-workers at Imperial College, London, produced valuable 
prototypes and made many important contributions to 
graphite science (e.g. stress-recrystallization, intercalation 
compounds). 
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(a) 




Figure 10.15 Structures of (a) crystalline graphite and (b) turbo stratic carbon ( from Cahn and Harris, 11 Jan, 1969, 
pp. 132-41; by permission of Macmillan Magazines Ltd). 


a turbostratic structure will ‘heal’ and approach the 
stable ABABAB. . . configuration. 

Various other methods are available for producing 
pyrolytic graphite. Composite structures for aerospace 
applications have been produced by depositing 
pyrolytic graphite on mats of carbon fibre. In the 
field of nuclear reactors, fluidized bed techniques have 
been used to coat fuel particles of uranium dioxide 
with separate PG layers of different anisotropy. In a 
fluidized bed, a rising flow of hydrocarbon gas and 
inert gas levitates the charge of particles; the bed 
temperature is very uniform and close temperature 
control is possible. Fluidized beds operating at a 
relatively low temperature of 1500°C have also been 
used to coat prosthetic devices such as dental implants 
and disc occluders for heart valves with a 1 mm layer 
of carbon. It is necessary to polish these coatings 
before use in order to remove any weakening flaws. 
Their wear resistance and strength have been enhanced 
by including silane in the bed atmosphere. Carbon is 
one of the dominant chemical elements in the human 
body and it is perhaps not surprising that it is a 
valuable prosthetic material, being compatible with 
living tissue and with blood and other body fluids. 

Structural disorder, in its various aspects, is largely 
determined by the conditions of synthesis; thus the 
relative orientation of the crystallites to each other, 
as developed by synthesis, may be parallel (laminar), 
random (isotropic), etc. Both laminar and isotropic 
deposits of pyrolytic graphite are hard and strong 
because L c is small. (Typically, L c values for PG are 
about 3-10 nm.) Although highly anisotropic deposits 
are possible, further enhancement can be achieved 
by ‘stress-recrystallization’. For instance, near-ideal 
graphite with anisotropy ratios of thermal conductivity 


as high as 200 can be produced by encasing a PG 
deposit in an envelope of commercial poly crystalline 
graphite and hot-pressing at temperatures in the order 
of 3000°C. The envelope deforms plastically and the 
compression forces align the crystallites. In laminar 
forms of pyrolytic graphite, the anisotropy is such 
that the material acts as an excellent conductor in 
directions parallel to the hexagonal networks and as 
an equally efficient insulator in the c-direction. Thus, 
the temperature gradient in the direction perpendic- 
ular to the hexagonal networks can be very steep 
(say, 300°C mm -1 ). One practical consequence of this 
gradient occurs during the growth of a PG deposit 
on a heated mandrel, when the temperature at the 
gas/deposit interface gradually decreases, causing the 
structure of the deposited carbon to change as well. 
(This is one reason PG deposits are often very thin: 
the thickest deposits are usually about 30-40 mm.) On 
the other hand, engineering advantage can be taken of 
the anisotropy of laminar PG. For example, pyrolytic 
graphite was advocated by Ubbelohde for small cham- 
bers or crucibles in induction furnaces operating at 
temperatures of 2000 -3 000° C. Chamber walls with 
their internal surfaces parallel to the a -layers favour 
lateral heat transfer and give temperature uniformity 
within the working space, a feature usually sought by 
furnace designers. 1 At the same time, the external wall 
surfaces operate at much lower temperatures; radiation 
losses in accordance with Stefan’s law (T 4 ) are min- 
imized, to the benefit of the thermal efficiency of the 
furnace unit. 

l A tobacco pipe with a PG-lined bowl has been marketed; 
one would expect the high thermal conductance of its 
interior to sustain slow combustion. 
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The glassy, low-porosity material known as vitreous 
carbon is derived from the pyrolysis of a thermosetting 
polymer, rather than from a hydrocarbon gas (e.g. 
phenol formaldehyde resin). In a typical production 
procedure, a specially prepared resin is shaped by 
moulding, carbonized in an inert atmosphere at a 
temperature of 900° C and finally fired in a vacuum 
at a temperature of 1800°C. Very large amounts of 
gas are evolved; because of their disruptive effect, 
the thickness of the product is necessarily restricted 
to about 5-8 mm. Shrinkage figures in the order of 
20-50% have been quoted. These features demand 
close control of firing conditions. 

The structure of vitreous carbon inherits certain 
bonding characteristics from the complex organic pre- 
cursor. Thus graphitization and ordering are inhibited. 
The crystallites are much smaller than those developed 
in pyrolytic graphite. They are only about 5 nm across. 
The crystallites have a larger-than-ideal interlayer dis- 
tance of about 0.35 nm. It appears that cross-links 
persist between the layers and that these bonds have 
a stabilizing and strengthening effect. Furthermore, 
because the crystallites are randomly oriented, vitreous 
carbon is isotropic. Like silica glass, vitreous carbon 
is hard, brittle and fractures in a conchoidal manner. 

Vitreous carbon is valued for its ability to with- 
stand penetration and attack by aggressive liquids such 
as concentrated hydrofluoric acid, caustic alkalis and 
molten metals (e.g. Al, Cu, Pb, Sn. Zn). Its very low 
porosity and high purity contribute to this inertness. 
Generally, it has good resistance to attack by metals 
which do not form carbides. However, like graphite, 
it is attacked by alkali metals. In the chemical labo- 
ratory, vitreous carbon is used for crucibles, furnace 
boats, filters, non-stick burette taps, etc. in chemical 
analysis, crystal growing and zone-refining activities, 
often proving superior to traditional materials such as 
platinum, nickel, glass and quartz. Serving the elec- 
tronics industry, vitreous carbon ware has been used in 
the manufacture of gallium phosphide and arsenide for 
optical/microwave devices, germanium for transistors 
and special glasses for fibre optics. In fact, it was the 
urgent need for an inert container for gallium phos- 
phide which stimulated the development of vitreous 
carbon. In the chemical industry, it is used for rotating 
seals in pumps handling difficult liquids. In addition 
to being harder and stronger than all other forms of 
carbon, with the exception of diamond, it is conduc- 
tive and refractory at temperatures up to 3000°C and 
can be used to make resistance- or induction-heated 
reaction vessels. Ultrasonic machining of vitreous car- 
bon is feasible, producing sound and clean-cut edges. 
(Graphite has a more layered structure and machining 
tends to round or break up edges.) 

10.4.6.8 Chemical reactivity and intercalation of 
graphite 

Study of the manner in which foreign atoms interact 
with and penetrate the layered structure of graphite has 


proved fruitful. As the interlayer distance in graphite is 
quite large, with relatively weak forces acting between 
layers, it is chemically convenient to regard each layer 
as a very large macroaromatic molecule. (This assump- 
tion is reasonable because the covalent C-C bond 
length of 0.1415 nm in the layers is similar to that found 
in the aromatic organic molecule of benzene, C 6 H 6 .) 
Within individual layers of most forms of graphite, there 
are usually clusters of vacant sites which form ‘holes’. 
Although chemical reactions with foreign atoms which 
penetrate between layers can take place at these ‘holes’, 
they are more likely to occur at the layer edges where 
free valency bonds project. For instance, attack by oxy- 
gen is favoured when the edges of layers impinge on 
free surfaces, voids or cracks. If reaction with for- 
eign atoms does occur within the layers, newly-formed 
bonds can cause the layers to buckle. 

In contrast to these rather random forms of 
attack and reaction, it is possible to interpose 
(intercalate) substantial numbers of foreign atoms or 
molecules between the parallel macromolecules to 
form stoichiometric graphite compounds, as indicated 
in Figure 10.16. (Ubbelohde coined the term ‘synthetic 
metals’ for this new class of material.) These 
intercalation compounds are produced by exposing 
graphite to a vapour phase or chemical solution or by 
electrolysis. By using potassium as the intercalate, a 
family of potassium graphites has been produced (e.g. 
C 60 K, C4gK, C36K, C24K, CgK). This regular sequence 



Figure 10.16 Potassium graphite, C$K. Hexagon layers of 
graphite separate and slip into the unstaggered sequence to 
permit intercalation of potassium atoms (from Ubbelohde, 
1964; by permission ofBCURA Ltd., Coal Res. Establ., 
Stoke Orchard, Cheltenham ). 
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of stoichiometric ratios shows that as intercalation 
proceeds, the distance between the potassium-filled 
regions decreases stepwise in a systematic manner. 
The intercalates force the macromolecules apart and 
sometimes cause them to shear past each other, 
changing the ABABAB . . . stacking sequence to 
AAAA . . . Intercalation species include alkali metals, 
halogens, caesium and rubidium, nitrate, bisulphate 
and phosphate radicals. 

Intercalation produces interesting electrical effects 
involving two forms of charge transfer. Graphite itself 
has moderate electrical conductivity at low tempera- 
tures with about one charge carrier per 10 4 atoms of 
carbon. (In copper, the ratio is 1:1.) Intercalation with 
potassium to form potassium graphite (C 8 K) raises the 
number of charge carriers to one in ten (— ve), with the 
electron-donating alkali metal injecting electrons into 
the ‘empty’ band of energy levels in graphite. Con- 
versely, when bromine intercalates to form the com- 
pound C 8 Br, the electrical conductivity again increases 
significantly as a result of the halogen atoms acting 
as electron acceptors and removing electrons from the 
‘full’ electron band of graphite. In this case, the num- 
ber of carriers rises to about one in ten (+ve). The 
electron transfer associated with intercalation can pro- 
duce striking colour changes: the potassium graphites 
C24K (intermediate) and C 8 K (saturated) are blue and 
golden bronze, respectively. These unique processes 
of charge transfer have provided valuable insights into 
the chemistry of bonding. 

As a general rule, near-ideal graphite intercalates 
more readily than less perfect forms; that is, the inter- 
calate can penetrate more easily between the macro- 
molecules without hindrance from interlayer bonds. 
Certain substances have the ability to ‘unpin’ the 
layers, acting as intercalation catalysts (e.g. iodine 
monochloride). These catalysts have been used in 
large-scale chemical processing. Intercalation with 
halogens (bromine or chlorine) has been used to 
strengthen the vital bond between filaments and the 
resin matrix in carbon fibre ‘reinforced’ polymers. 
Intercalation forms the basis of rapid proving tests 
that have been used by the nuclear industry to sim- 
ulate the action of neutron bombardment upon pro- 
totype graphites. The intercalating bromine atoms 
force the layers of graphite to open up in the c- 
direction. 

10.4.6.9 From buckminsterfullerene to fullerite 

Circa 1985, examination of vaporized carbon led to the 
exciting discovery of a new and unexpected molecular 
form. 1 Mass spectrometry revealed that stable clus- 
ters of carbon atoms, relative molecular mass 720, 
were present in significant quantity. It was postu- 
lated that these molecules were spherical and cage- 
shaped, with 60 carbon atoms forming the vertices 

discovered by Professor Harold W. Kroto and colleagues 
at Sussex University. 



Figure 10.17 Spherical molecule of buckminsterfullerene 
( courtesy of Professor H. W. Kroto, University of Sussex, 
UK). 


of a truncated icosahedron to give a C60 shell struc- 
ture that is reminiscent of geodesic buildings and 
of a soccer ball (Figure 10.17). The C60 molecule 
was named buckminsterfullerene. 2 Larger fullerene 
molecules have since been produced which have more 
than 200 carbon atoms per shell. 

The discovery of this third form of carbon, with 
symmetrical characteristics completely different to 
those of diamond and graphite, prompted attempts 
to evolve synthetic routes capable of producing solid 
buckminsterfullerene in macroscopical quantities large 
enough for more detailed scientific study and for com- 
mercial exploitation. Powder and thin-film samples of 
a solid crystalline form, fullerite, were subsequently 
produced in gram quantities. 3 In the basic method, a 
sooty mixture of graphitic carbon, C 60 and C 70 is pro- 
duced by striking an arc between carbon electrodes 
in an atmosphere of helium. The other fullerene, C70, 
which forms in smaller quantities than C 6 o, appar- 
ently has elongated molecules. The soot contains about 
5-10% of the two fullerenes. After extraction with 
benzene, micrometre- sized crystals of C60 molecules 
form. Some disordering of stacked hexagonal arrays of 

2 Named after the American architect R. Buckminster Fuller 
(1895-1983) who pioneered the geodesic building, a 
revolutionary design form which encloses the maximum of 
space with the minimum of materials. Many spherical 
viruses also have icosahedral symmetry. 

3 Solid C 60 was produced by collaboration between 
D. R. Huffman and W. Kratschmer, and their co-workers, 
at the University of Arizona, USA, and the Max-Planck- 
Institut fiir Kernphysik, Heidelberg, Germany, respectively. 
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C^o molecules is evident. STEM micrographs of mono- 
layers have been produced which clearly show hexag- 
onally close-packed arrays of the spherical molecules 
(i.e. six-fold symmetry) 

In parallel with its impact on organic and poly- 
mer chemistry, C 6 o stimulated many proposals in 
the materials field. Proposed ‘buckyball’ products 
have included lubricants, semiconductors and fila- 
mentary reinforcement for composites. It has been 
suggested that the cage-like structure of individual 
molecules makes them suitable for encapsulating metal 
or radioisotope atoms, a form of molecular- sc ale pack- 
aging (e.g. La 3+ ). There is also interest in bonding ions 
to their exterior surfaces. 


10.5 Aspects of glass technology 

10.5.1 Viscous deformation of glass 

Viscosity 4 is a prime property of glass. Although the 
following account deals mainly with the working and 
annealing of glass, it is broadly relevant to glazes and 
to the glassy intergranular phase found in many ceram- 
ics. Two mathematical expressions help to describe the 
nature of viscous flow in glasses. First, there is the 
formula for Newtonian flow in an ideal fluid: 

8y/8t = r • 0 = r • I /37 

where 8y/8t is the shear strain rate, r is the applied 
shear stress, 0 is the fluidity coefficient and 37 is 
the viscosity coefficient. In its melting range, a 
typical Si0 2 -Na 2 0-Ca0 glass has a viscosity of 
5-50 N s m -2 . (The viscosity of liquid metals is 
roughly 1 mN s m -2 .) As glass is hot-drawn, its cross- 
sectional area decreases at a rate which depends solely 
upon the drawing force and viscosity, not upon area. 
For this reason, the glass extends uniformly and does 
not ‘neck’. 

The second expression, which is of the Arrhenius- 
type, concerns the temperature-dependency of visco- 
sity, as follows: 

r] = A exp (Q/kT) 

where A is a constant, Q is the activation energy, 
k is the Boltzmann constant and T is the absolute 
temperature. The activation energy for viscous flow is 
accordingly obtained by plotting log 37 against 1 /T and 

4 Dynamic (absolute) viscosity is the force required to 

move 1 m 2 * of plane surface at a velocity of 1 m s _1 to a 

second plane surface which is parallel to the first and 

separated 1 m from it by a layer of the fluid phase. 

Kinematic viscosity = absolute viscosity/density. Unit of 

viscosity = 1 N s m -2 = 10 P (poise). Fluidity is the 

reciprocal of viscosity. 


measuring the slope of the line. Although the above 
two expressions are valid for many types of glass, 
for certain glasses there are conditions of temperature 
and stress which result in non-Newtonian and/or non- 
Arrhenian behaviour. 

As indicated in Section 2.6, the viscosity of oxide 
glasses depends upon composition and temperature. 
Generally, an increase in the concentration of modi- 
fying cations and/or a rise in temperature will cause 
the viscosity to fall. Figure 10.18 shows the rela- 
tion between viscosity and temperature for a typical 
Si0 2 -Na 2 0-Ca0 glass. Two key values of viscosity 
are marked; they correspond to two practical temper- 
atures which are known as (1) the Fittleton softening 
point (viscosity = 10 6 6 N s m -2 ) and (2) the annealing 
point (viscosity = 10 124 N s m -2 ). 

The softening point is determined by a standard- 
ized procedure and gives the maximum temperature at 
which the glass can be handled without serious change 
in dimensions. For ordinary silica glass, it is about 
1000 K. At the annealing temperature, the ions are suf- 
ficiently mobile to allow residual stresses to be relieved 
in about 15 min. The point in the curve at which the 
slope is a maximum corresponds to the inflection (Ac- 
tive or glass transition temperature) in the specific vol- 
ume/temperature curve for the particular glass-forming 
system. The working range for commercial silica glass 
corresponds to a viscosity range of 10 3 — 10 7 * * * N s m -2 . 
The curve for this glass is quite steep, indicating that 
temperature control to within, say, ±10°C is necessary 
during working (i.e. drawing, blowing, rolling, etc.). 
Figure 10.19 provides a comparison of the viscosity 
curves for different types of glass. The difficulties of 
working pure silica glass (‘fused quartz’) are immedi- 
ately apparent. Even at a temperature of 1300°C this 
glass has a viscosity of about 10 12 N s m~ 2 , which is 
still too high for working. Glasses with special chemi- 
cal and physical properties often have a steep viscosity 
curve and tend to devitrify, presenting difficulties dur- 
ing drawing at traditional working temperatures. One 



Figure 10.18 Viscosity versus temperature curve for a 
typical Si02 -Na 2 O-CaO glass. 
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Figure 10.19 Viscosity curves for typical glasses. 


solution has been to hot-extrude such glasses through 
graphite dies into rod or tube form. 1 By using very 
high pressures (up to 10 GN m -2 ), it is possible to 
overcome a high viscosity of 10 7 N s m -2 and to use 
a lower extrusion temperature (e.g. 950° C). Such tem- 
peratures, which can be as much as 200° C lower than 
the temperature required for traditional methods, pre- 
vent devitrification and, by suppressing surface tension 
effects, help to form rods and tubes with sharp and 
accurate profiles. 

10.5.2 Some special glasses 

Most glasses produced are based upon silica and fluxed 
with alkali metal oxides, with their chemical formu- 
lation adjusted to suit the application (e.g. bottles, 
plate glass, lamps, etc.). In its purest form, vitreous 
silica has a very low linear coefficient of thermal 
expansion {a = 0.55 x 10 _6o C _1 ) and is used in high- 
temperature applications. ‘Fused quartz’ is made by 
melting sand; high-purity ‘fused silica’ is made by 
vapour deposition after reacting SiCl 4 with oxygen at 
temperatures above 1500°C. (These two definitions are 
tentative.) 

Window (‘soda-lime’) glass, typically 72Si0 2 - 
14Na 2 O-10CaO-2MgO-lAl 2 O 3 , owes its conve- 
nient range of working temperature (700- 1 000° C) to 
the two network-modifiers, Na 2 0 and CaO. The effec- 
tiveness of their fluxing action can be gauged by the 
fact that the softening point of this glass (700° C) is 
at least 800° C lower than that of high-purity silica 
glass. The softening point, as defined in the ASTM 
test method, is the temperature at which a glass fibre of 
uniform diameter elongates at a specific rate under its 
own weight. The chemical formulation given uses the 
‘system’ nomenclature recommended by F. V. Tooley 
(i.e. oxide components are stated in order of decreasing 
% w/w.) 

1 Philips Research Laboratories, Aachen, Germany. 


Pyrex glass, 81Si0 2 -13B 2 03-4Na 2 0-2Al 2 03, 
combines two network-formers and is well known for 
its chemical durability. Because of its relatively low 
expansivity ( a = 3.3 x 10 _6o C _1 ; 0-300°C), it is tol- 
erant to thermal shock and is widely used for chemi- 
cal glassware in laboratories and industrial processing 
plant. Sometimes it is referred to as a ‘borosilicate’ 
glass, unfortunately implying crystallinity. 

In certain glasses, being an intermediate, PbO can 
actually form a network. This oxide is a flux and, 
unlike Na 2 0, does not reduce the electrical resistiv- 
ity; glasses in the Si 0 2 -Pb 0 -Na 2 0 -Al 2 03 system are 
widely used for electronic and electrical components. 
Their high lead content favours absorption of X-rays 
and y-rays, making them suitable for radio-logical 
shielding (65% PbO) and colour television tubes (23% 
PbO). Lead also increases the refractive index and dis- 
persion of Si0 2 -based glasses. These effects are used 
to advantage in optical lenses and in the decorative 
glass known confusingly as ‘lead crystal’ . In the latter 
case, 20-30% PbO provides craftsmen with a ‘long’ 
glass, that is, one with a large temperature interval for 
working/manipulation, within which viscosity changes 
very slightly. 

Alumina (A1 2 0 3 ), an intermediate, frequently 
features in the composition of glasses. In small 
amounts, say 1-2% w/w, it improves the resistance 
of Si0 2 -based glasses to attack by water. However, its 
presence increases melt viscosity and makes melting 
and fining more difficult; for instance, gas bubbles 
separate less readily. The dependence of viscosity upon 
composition is always an important matter in glass 
technology. Al 3+ cations enter the tetrahedral holes in 
the network, replacing tetravalent silicon cations. As 
a result, the number of non-bridging O 2- anions is 
reduced and the strength and rigidity of the network 
is enhanced. By substantially raising the alumina 
content to 15-25% w/w, it is possible to develop 
valuable stability at high temperatures. For exam- 
ple, 62Si0 2 -17Al 2 0 3 -8Ca0-7Mg0-5B 2 0 3 -lNa 2 0 
glass has a softening point in excess of 900° C, which 
is about 100°C higher than that of Pyrex glass. This 
‘aluminosilicate’ glass is used for combustion tubes. 

10.5.3 Toughened and laminated glasses 

Glasses, in general, are weak when placed in ten- 
sion during bending and do not withstand impact 
blows. However, in compression, glasses can exhibit 
exceptional strength. Accordingly, sheet glass is often 
strengthened and toughened by developing high resid- 
ual compressive stresses in the surface layers. They 
oppose any applied tensile stresses (say, on the convex 
face of a bent sheet of glass), and must be overcome 
before this vulnerable surface begins to develop dan- 
gerous tensile stresses. Various physical and chemical 
processes, applicable on a mass-production scale, are 
available to improve the load-bearing capacity and 
behaviour under impact conditions. 
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(a) Thermally toughened glass plate 



(b) Chemically toughened glass plate 


Figure 10.20 Distribution of residual stress in toughened 
glass plate (stress values in MN m~ 2 ). 

The term ‘toughening’ generally refers to both heat- 
treatment and chemical methods. In thermal temper- 
ing, a toughening process used extensively for side 
and rear glazing in cars, glass sheets are cut and 
shaped, heated uniformly to a temperature above T g 
(say, 700°C), soaked for 2-4 min and then quenched 
with jets of air. Glass has a low thermal conductiv- 
ity and the eventual contraction of the core regions is 
restrained by the rigid outer layers, producing the bal- 
ance of compressive (— ve) and tensile (+ve) stresses 
shown in Figure 10.20a. The glass sheet must be free 
from flaws in order to withstand the severe quench 
stresses and must be at least 2-3 mm thick so that the 
desired stress gradients develop. Surface stresses up 
to 200 MN m -2 are produced by this method. Heat- 
ing toughened glass to a temperature of 200° C will 
relieve these stresses. Small blunt-edged fragments are 
produced when toughened glass fractures underimpact. 
Toughened products cannot be cut or shaped; notching, 
drilling, etc. must be done before treatment. 

Chemical strengthening/toughening is also used to 
develop protective compressive stresses. These treat- 
ments are more versatile, being suitable for thin and 
thick sections of glass and for unusual shapes (e.g. 
Chemcor.) However, because ion exchange and diffu- 
sion are involved, they require longer process times. 
Their purpose is to alter the surface composition and 
reduce its coefficient of thermal expansion. On cool- 
ing from a high temperature, the greater contraction 
of the hot core is restrained and very high compres- 
sive stresses develop in the modified surface layers 
(Figure 10.20b). In one version of chemical strength- 
ening, sodium-containing glass is heated in an atmo- 
sphere of water vapour and sulphur dioxide and an 
exchange between H + ions and Na + ions takes place 
in the glass surface (i.e. ‘H-for-Na’). In another high- 
temperature version, a ‘Li-for-Na’ exchange occurs 
when the glass is immersed in a melt of lithium salt. 
In low-temperature chemical strengthening, large ions 


from molten salt displace smaller ions from the sur- 
faces of immersed glass, causing them to expand (e.g. 
‘K-for-Na’, ‘Na-for-Li’). On cooling from a relatively 
low temperature, the core is rigid and the presence 
of absorbed, larger ions in the surface layers develops 
compressive stresses. Chemical strengthening methods 
often use glasses of special composition. 

The bottle-manufacturing industry uses chemical 
surface treatments to strengthen returnable bottles 
(improving trippage) and to provide a useful lubricity 
as they pass through bottle-filling plant at high speed. 
(Rubbing two glass bottles together reveals whether a 
lubricating coating is present.) 

The alternative to the above toughening methods, 
based upon a composite approach, is to interleave two 
matched sheets of ‘float’ glass with an interlayer of 
plastic, nearly 1 mm thick. Polyvinyl butyrate (PVB) 
is commonly used for the interlayer. In the mid-1960s, 
this more costly method gained favour for car wind- 
screens as a result of accident statistics which indi- 
cated that a laminated windscreen was less likely to 
cause fatal injury than a toughened one. 1 Cracking 
is confined to the area of impact. The PVB inter- 
layer deforms elastically, absorbs impact energy and 
holds glass fragments in place, reducing the risk of 
facial lacerations. Visibility is retained after impact. 
De-icing windscreens have fine heating filaments of 
tungsten embedded between the glass layers (Triplex 
Hotscreen). Laminated glass with a thicker plastic 
interlayer is used for aircraft windscreens; these must 
withstand extreme thermal conditions, ranging from 
high altitudes (— 80°C) to kinetic heating (150°C), 
and the possible high-speed ‘strikes’ of birds weighing 
about 2 kg. 

The manufacture of laminated glass demands clean- 
liness so that the PVB will bond strongly to the glass. 
Curved windscreens are made by separately bending 
matched pairs of glass sheet at a temperature just above 
600° C. After cooling, PVB is applied to one sheet of 
each pair in an air-conditioned room and the matching 
sheet added. Air can be removed from the PVB/glass 
interfaces of the curved assembly by applying a vac- 
uum to its edges. The glazing is finally autoclaved at 
a pressure of 1.2- 1.4 MN nrT 2 and a temperature of 
135-145°C in order to complete the bonding. 

The oil crisis of the mid-1970s underlined the 
need for weight saving and fuel efficiency in car 
design. As a direct consequence, the thickness of lam- 
inated windscreens decreased from 3.0 mm/3.0 mm 
to 2.1 mm/2.1 mm. (Toughened glass decreased in 
thickness from 5-6 mm to 3-4 mm.) Advances in 
glass technology have made it possible for modem 
car designs to increase the area of glazing, use flush, 

1 Strong sunlight striking a quench-toughened window glass 
is polarized by reflection; the stress-birefringent patterns 
produced by the quench jets are often apparent, particularly 
when viewed through Polaroid sunglasses. A pastime for 
traffic jams is to note the choice made by car manufacturers 
between quench-toughened and laminated glass. 
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polyurethane adhesive-mounted windows (of low drag 
coefficient) and achieve rounded, aerodynamic pro- 
files. 1 

10.6 The time-dependency of strength 
in ceramics and glasses 

Time and strain-rate have a special relevance to 
the strength of ceramics. For example, the measured 
strength of individuals taken from a set of ceramic 
specimens can show an appreciable decrease over a 
period of time. With regard to the mechanism of such 
time-dependent failure, it is first necessary to distin- 
guish between fast and slow crack growth. 

Fast crack growth, as produced in a routine bend 
test, takes place when the crack reaches a critical 
size. Slow or delayed crack growth takes place in 
a stressed ceramic over a relatively long period of 
time and involves slow propagation of a crack of 
sub-critical dimensions. Fast fracture occurs when this 
crack reaches criticality. This phenomenon of progres- 
sive weakening and degradation, which is termed static 
fatigue or delayed fracture, has long been known in 
glasses. It is now recognized in a wide variety of 
ceramics (e.g. alumina, slicon nitride, vitreous car- 
bon). Its occurrence in poly crystalline oxide ceramics 
is often related to the presence of a glassy bonding 
phase. The stresses capable of causing static fatigue 
can be continuous or fluctuating: even at quite low 
stress levels, there is a finite probability of failure by 
this insidious process. Sometimes chemical agencies 
have an overriding effect. For example, when mois- 
ture is able to diffuse to crack tips in silica glasses and 
alumina, the velocity of crack growth increases. This 
type of behaviour is understandably regarded as stress- 
corrosion. Slow crack growth is a thermally-activated 
process and an increase in the ambient temperature can 
have a very significant effect. 

The nature of delayed crack growth is customarily 
expressed by plotting crack velocity (V) against the 
stress intensity factor (Ki), as shown in Figure 10.21. 
This plot is idealised but is typical of ceramics which 
are capable of slow crack growth at ambient tempera- 
ture, such as glasses and oxide ceramics. Three regions 
of crack growth lie between the ‘static fatigue limit’ 
( Ki 0 ), below which crack growth is not of concern, and 
the critical value of the stress-intensity factor (Ki c ) at 
which sudden fracture occurs. In region I, the veloc- 
ity is an exponential function of the stress-intensity 
factor and can be expressed in the form V = A(Ki) n . 
In region II, velocity is independent of K\. Regions 
I and II are environment- sensitive and correspond to 
conditions of stress-corrosion. In region III, velocity is 

^he Jaguar XJ220 sports car has compound-curvature 
front, top and rear screens which blend smoothly with the 
body contours. 



Figure 10.21 Idealized plot of relation between crack 
velocity and stress-intensification. 

again dependent upon K\ but is independent of envi- 
ronment. 

Time-to-failure depends primarily upon the profile 
of region I. Its slope (n), sometimes known as the 
stress-corrosion constant of the particular material, 
ranges in value from about 10 to 100. Low values, such 
as those for glasses (10-40), indicate a susceptibility to 
slow crack growth. Figure 10.22 shows typical curves 
for a variety of ceramic materials. Changes in envi- 
ronment (temperature, pH, etc.) can displace a curve 
and/or alter its slope. For instance, at elevated temper- 
atures, the slopes of curves for carbides and nitrides 
become less steep. 

Study of the statistical nature and time-dependency 
of strength in ceramics led to the development of 
strength-probability-time (SPT) diagrams 2 of the type 
shown in Figure 10.23. These diagrams refer to spe- 
cific conditions of testing and are plotted on Weibull 
probability paper. As the n -value (for region I crack 
growth) decreases, indicating increased sensitivity to 
environment, the sloping lines close up. Such dia- 
grams clearly show the interplay between stress, the 
probability of survival and the lifetime before 
fracture. 

Recognition of the fact that there is apparently no 
absolutely safe stress for a ceramic which is sub- 
ject to long-term service stimulated interest in proof- 
testing. By eliminating weaker members, such tests 
provide survivors with a better (narrower) distribution 
of strength than that of the original population. Statis- 
tical predictions of service life benefit accordingly. It 
must be borne in mind that the proportion of survivors 
reflects the conditions of testing. Obviously, the period 
of testing is important; the longer the time, the smaller 
the number of survivors. Ideally, proof-tests should be 
carried out under conditions that do not cause sub- 
critical crack growth. 

2 Devised by R. W. Davidge and co-workers at the Atomic 
Energy Research Establishment, Harwell. 
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Figure 10.22 Relation between crack velocity and stress intensity factor for various ceramics in neutral water at 25° C (after 
Creyke et al., 1982; by courtesy of Elsevier/Chapman and Hall). 



Figure 10.23 Strength-probability -time diagram for alumina derived from bend tests at 293 K: the lifetime r of 10 5 s is 
regarded as a tentative stress-corrosion limit (after Davidge, 1986, p. 145; by courtesy of Cambridge University Press). 
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Chapter 11 


Plastics and composites 


11.1 Utilization of polymeric 
materials 

11.1.1 Introduction 

In Chapter 2 the basic chemistry and structural 
arrangements of long-chain molecules were described 
and it was shown how polymers can be broadly 
classified as thermoplastics, elastomers and thermosets. 
Certain aspects of their practical utilization will now 
be examined, with special attention to processing; 
this stage plays a decisive role in deciding if a 
particular polymeric material can be produced as a 
marketable commodity. The final section (11.3) will 
concern composites, extending from the well-known 
glass-reinforced polymers to those based upon ceramic 
and metallic matrices. 


various molecular relaxation processes associated with 
the glass transition. In linear viscoelastic behaviour, 
total strain comprises a linear elastic (Hookean) com- 
ponent and a linear viscous (Newtonian) component. 
The stress -strain ratios depend upon time alone. In the 
more complex non-linear case, which usually applies 
to polymers, strain is a function of time and stress 
because molecular movements are involved. 

The phenomenon of stress relaxation can be used 
to chart the way in which the behaviour of a given 
polymer changes from glassy to rubbery. Figure 11.1 
shows the non-linear response of a polymer that is 
subjected to constant strain £ 0 - Stress a relaxes with 
time t. The relaxation modulus E r at time t is given 
by the expression: 

E r = cr t /s 0 (11.1) 


11.1.2 Mechanical aspects of T g 

As indicated in Chapter 2, it is customary to quote a 
glass-transition temperature T g for a polymer because 
it separates two very different regimes of mechanical 
behaviour. (The value of T g is nominal, being subject 
to the physical method and procedure used in its deter- 
mination). Below T g , the mass of entangled molecules 
is rigid. Above T g , viscoelastic effects come into play 
and it is therefore the lower temperature limit for pro- 
cessing thermoplastics. The structural effect of raising 
the temperature of a glassy polymer is to provide an 
input of thermal energy and to increase the vibrations 
of constituent atoms and molecules. Molecular mobilty 
increases significantly as T g is approached: rotation 
about C-C bonds in the chain molecules begins, the 
free volume of the structure increases and intermolecu- 
lar forces weaken. It becomes easier for applied forces 
to deform the structure and elastic moduli to fall. 

The mechanical properties of polymers are highly 
dependent upon time and temperature, the response 
to stress being partly viscous and partly elastic. For 
instance, ‘natural’ time periods are associated with the 


Thus E T , which is represented by the slope of 
dotted join lines, decreases with time. This variation 
is shown more precisely by a plot of log E r versus 
log time t , as in Figure 11.2a. The thermoplastic 



Figure 11.1 Stress relaxation at constant strain. 
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(a) (b) 




Figure 11.2 Time -temperature dependence of elastic modulus in thermoplastic polymeric solid: (a) change in relaxation 
modulus E r (t ) as function of time; (b) change in tensile modulus as function of temperature ( from Hertzberg, 1989; by 
permission of John Wiley and Sons). 


polymer changes in character from a glassy solid, 
where the relaxation modulus is a maximum, to a 
rubbery solid. In the complementary Figure 11.2b, 
data from standard tensile tests on the same polymer 
at different temperatures are used to provide values 
of elastic moduli ( E ). The similarity of profiles in 
Figures 11.2a and 11.2b illustrates the equivalence of 
time and temperature. (Theoretically, the modulus for 
a short time and a high temperature may be taken to 
equate to that for a combination of a long time and a 
low temperature; this concept is used in the preparation 
of relaxation modulus versus time graphs.) The glass 
transition temperature T g has been superimposed upon 
Figures 11.2a and 11.2b. 

Although single values of T g are usually quoted, 
the process of molecular rearrangement is complex 
and minor transitions are sometimes detectable. Thus, 
for PVC, the main glass transition occurs at tem- 
peratures above 80° C but there is a minor transition 
at — 40°C. Consequently, at room temperature, PVC 
exhibits some rigidity yet can elongate slightly before 
fracture. Addition of a plasticizer liquid, which has a 
very low T g , lowers the T g value of a polymer. Sim- 
ilarly, T g for a copolymer lies between the T g values 
of the original monomers; its value will depend upon 
monomer proportions. 

In elastomeric structures, as the temperature is 
increased, the relatively few crosslinks begin to vibrate 
vigorously at T g and the elastomer becomes increas- 
ingly rubbery. As one would anticipate, T g values 
for rubbers lie well below room temperature. Increas- 
ing the degree of crosslinking in a given polymer 
has the effect of raising the entire level of the lower 
‘rubbery’ plateau of the modulus versus temperature 
plot upwards as the polymer becomes more glassy 
in nature. The thermosets PMMA (Perspex, Lucite ) 
and PS have T g values of 105°C and 81°C, respec- 
tively, and are accordingly hard and brittle at room 
temperature. 

11.1.3 The role of additives 

Industrially, the term ‘plastic’ is applied to a polymer 
to which one or more property-modifying agents have 


been added. Numerous types of additive are used by 
manufacturers and fabricators; in fact, virgin polymers 
are rarely used. An additive has a specific function. 
Typical functions are to provide (1) protection from 
the service environment (anti-oxidants, anti-ozonants, 
anti- static agents, flame-retardants, ultraviolet radia- 
tion absorbers), (2) identification (dyes, pigments), 

(3) easier processability (plasticizers), (4) toughness, 
and (5) filler. 

In many instances the required amount of additive 
ranges from 0.1% to a few per cent. Although ultra- 
violet (UV) components of sunlight can structurally 
alter and degrade polymers, the effect is particularly 
marked in electric light fittings (e.g. yellowing). Stabi- 
lizing additives are advisable as some artificial light 
sources emit considerable amounts of UV radiation 
with wavelengths in the range 280-400 nm. For any 
polymer, there is a critical wavelength which will have 
the most damaging effect. For instance, a wavelength 
of 318.5 nm will degrade PS, which is a common 
choice of material for diffusers and refractors, by either 
causing cross-linking or by producing free radicals that 
react with oxygen. 

The action of a plasticizer (3) is to weaken inter- 
molecular bonding by increasing the separation of the 
chain molecules. The plasticizer may take the form of 
a liquid phase that is added after polymerization and 
before processing. Additions of a particulate toughener 

(4) such as rubber may approach 50% and the material 
is then normally regarded as a composite. 

A wide variety of fillers (5) is used for polymers. In 
the case of thermosets, substances such as mica, glass 
fibre and fine sawdust are used to improve engineering 
properties and to reduce the cost of moulded prod- 
ucts. PTFE has been used as a filler (15%) to improve 
the wear resistance of nylon components. Although 
usually electrically non-conductive, polymers can be 
made conductive by loading them with an appropriate 
filler (e.g. electromagnetic shielding, specimen mounts 
in SEM analysis). Fillers and other additives play an 
important role in the production of vulcanized rubbers. 
Inert fillers facilitate handling of the material before 
vulcanization (e.g. clay, barium sulphate). Reinforcing 
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Figure 11.3 (a) World consumption of plastics, (b) plastics consumption by market sector, Western Europe, and (c) destination 
of post-consumer plastic waste, Western Europe (courtesy of Shell Briefing Services, London ). 


fillers restrict the movement of segments between the 
branching points shown in Figure 2.24. For instance, 
carbon black has long been used as a filler for car tyres, 
giving substantial improvements in shear modulus, tear 
strength, hardness and resistance to abrasion by road 
surfaces. Tyres are subject to fluctuating stresses dur- 
ing their working life and it has been found useful 
to express their stress -strain behaviour in terms of 
a dynamic shear modulus (determined under cyclic 
stressing conditions at a specified frequency). Time is 
required for molecular rearrangements to take place 
in an elastomer; for this reason the dynamic modu- 
lus increases with the frequency used in the modulus 
test. The dynamic modulus at a given frequency is 
significantly enhanced by the crosslinking action of 
vulcanization and by the presence of carbon black. 
These carbon particles are extremely small, typically 
20-50 nm diameter. Small amounts of anti-oxidants 1 
and anti-ozonants are often beneficial. The principal 
agents of degradation under service conditions are 
extreme temperatures, oxygen and ozone, various liq- 
uids. In the last case a particular liquid may penetrate 
between the chains and cause swelling. 

11.1.4 Some applications of important 
plastics 

Figures 11.3a and 11.3b summarize 1990 data on 
world and West European consumption of plastics. The 
survey included high-volume, low-price commodity 
plastics as well as engineering and advanced plastics. 
Thermoplastics dominate the market (i.e. PE, PVC and 
PP). Development of an entirely new type of plastic is 
extremely expensive and research in this direction is 

'in the 1930s, concern with oxidation led the Continental 
Rubber Works, Hannover, to experiment with 
nitrogen-inflation of tyres intended for use on Mercedes 
‘Silver Arrows’, Grand Prix racing cars capable of 300 
km/h. This practice was not adopted by Mercedes-Benz for 
track events. 


limited. Research is mainly concerned with improv- 
ing and reducing the cost of established materials 
(e.g. improved polymerization catalysts, composites, 
thermoplastic rubbers, waste recycling). 

The low- and high-density forms of polyethylene, 
LDPE and HDPE, were developed in the 1940s and 
1950s, respectively. Extruded LDPE is widely used as 
thin films and coatings (e.g. packaging). HDPE is used 
for blow-moulded containers, injection-moulded crates 
and extruded pipes. An intermediate form of adjustable 
density known as linear low-density polyethylene, 
LLDPE, became available in the 1980s. Although 
more difficult to process than LDPE and HDPE, film- 
extruded LLDPE is now used widely in agriculture, 
horticulture and the construction industry (e.g. heavy- 
duty sacks, silage sheets, tunnel houses, cloches, damp- 
proof membranes, reservoir linings). Its tear strength 
and toughness have enabled the gauge of PE film to be 
reduced. A further variant of PE is ultra-high molecu- 
lar weight polyethylene, UHMPE, which is virtually 
devoid of residual traces of catalyst from polymer- 
ization. Because of its high molecular mass, it needs 
to processed by sintering. UHMPE provides the wear 
resistance and toughness required in artificial joints of 
surgical prostheses. 

Polyvinyl chloride (PVC) is the dominant plastic in 
the building and construction industries and has effec- 
tively replaced many traditional materials such as steel, 
cast iron, copper, lead and ceramics. For example, 
the unplasticized version (UPVC) is used for win- 
dow frames and external cladding panels because of 
its stiffness, hardness, low thermal conductivity and 
weather resistance. PVC is the standard material for 
piping in underground distribution systems for potable 
water (blue) and natural gas (yellow), being corrosion- 
resistant and offering small resistance to fluid flow. 
Although sizes of PVC pipes tend to be restricted, PVC 
linings are used to protect the bore of large-diameter 
pipes (e.g. concrete). The relatively low softening tem- 
perature of PVC has stimulated interest in alternative 
piping materials for underfloor heating systems. Poly- 
butylene (PB) has been used for this application; being 
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supported, it can operate continuously at a temperature 
of 80° C and can tolerate occasional excursions to 
11 CPC. However, hot water with a high chlorine con- 
tent can cause failure. Other important building plastics 
are PP, ABS and polycarbonates. Transparent roofing 
sheets of twin- walled polycarbonate or PVC provide 
thermal insulation and diffuse illumination: both mate- 
rials need to be stabilized with additives to prevent UV 
degradation. 

The thermoplastic polypropylene, PP ( Propathene ) 
became available in the early 1960s. Its stiffness, tough- 
ness at low temperatures and resistance to chemicals, 
heat and creep (T m = 165-170°C) are exceptional. PP 
has been of particular interest to car designers in their 
quest for weight-saving and fuel efficiency. In a typi- 
cal modem saloon car, at least 10% of the total weight 
is plastic (i.e. approximately 100 kg). PP, the lowest- 
density thermoplastic (approximately 900 kg m~ 3 ), is 
increasingly used for interior and exterior automotive 
components (e.g. heating and ventilation ducts, radiator 
fans, body panels, bumpers (fenders)). It is amenable 
to filament-reinforcement, electroplating, blending as a 
copolymer (with ethylene or nylon), and can be pro- 
duced as mouldings (injection- or blow-), films and 
filaments. 

Polystyrene (PS) is intrinsically brittle. Engineering 
polymers such as PS, PP, nylon and polycarbonates 
are toughened by dispersing small rubber spheroids 
throughout the polymeric matrix; these particles con- 
centrate applied stresses and act as energy- absorbing 
sites of crazing. The toughened high-impact form of 
polystyrene is referred to as HIPS. PS, like PP, is nat- 
urally transparent but easily coloured. When supplied 
as expandable beads charged with a blowing agent, 
such as pentane, PS can be produced as a rigid heat- 
insulating foam. 

Some of the elastomers introduced in Chapter 2 will 
now be considered. Polyisoprene is natural rubber; 
unfortunately, because of reactive C-C links in the 
chains it does not have a high resistance to chemical 
attack and is prone to surface cracking and degradation 
(‘perishing’). Styrene-butadiene mbber (SBR), intro- 
duced in 1930, is still one of the principal synthetic 
rubbers (e.g. car tyres). 1 In this copolymer, repeat 
units of butadiene 2 are combined randomly with those 
of styrene. Polychloroprene (Neoprene), introduced in 
1932, is noted for its resistance to oil and heat and 
is used for automotive components (e.g. seals, water 
circuit pipes). 

As indicated previously, additives play a vital role in 
rubber technology. The availability of a large family of 

1 Synthetic (methyl) mbber was first produced in Germany 
during World War I as a result of the materials blockade; 
when used for tyres, vehicles had to be jacked-up overnight 
to prevent flat areas developing where they contacted the 
ground. 

2 Butadiene, or but-2-ene, is an unsaturated derivative of 

butane C 4 H 10 ; the central digit indicates that the original 
butene monomer C 4 H 8 contains two double bonds. 


rubbers has encouraged innovative engineering design 
(e.g. motorway bridge bearings, mounts for oil-rigs 
and earthquake-proof buildings, vehicle suspension 
systems). 

Silicone rubbers may be regarded as being inter- 
mediate in character to polymers and ceramics. From 
Table 2.7 it can be seen that the long chains con- 
sist of alternating silicon and oxygen atoms. Although 
weaker than organic polymers based upon carbon 
chains, they retain important engineering properties, 
such as resilience, chemical stability and electrical 
insulation, over the very useful temperature range of 
— 100°C to 300°C. These outstanding characteristics, 
combined with their cost-effectiveness, have led to the 
adoption of silicone rubbers by virtually every industry 
(e.g. medical implants, gaskets, seals, coatings). 

11.1.5 Management of waste plastics 

Concern for the world’ s environment and future energy 
supplies has focused attention on the fate of waste 
plastics, particularly those from the packaging, car 
and electrical/electronics sectors. Although recovery 
of values from metallic wastes has long been prac- 
tised, the diversity and often complex chemical nature 
of plastics raise some difficult problems. Nevertheless, 
despite the difficulties of re-use and recycling, it must 
be recognized that plastics offer remarkable properties 
and are frequently more cost- and energy-effective than 
traditional alternatives such as metals, ceramics and 
glasses. Worldwide, production of plastics accounts for 
about 4% of the demand for oil: transport accounts 
for about 54%. Enlightened designers now consider 
the whole life-cycle and environmental impact of a 
polymeric product, from manufacture to disposal, and 
endeavour to economize on mass (e.g. thinner thick- 
nesses for PE film and PET containers (‘lightweight- 
ing’)). Resort to plastics that ultimately decompose 
in sunlight (photodegradation) or by microbial action 
(biodegradation) represents a loss of material resource 
as they cannot be recycled; accordingly, their use tends 
to be restricted to specialized markets (e.g. agriculture, 
medicine). 

Figure 11.3c portrays the general pattern of plas- 
tics disposal in Western Europe. Landfilling is the 
main method but sites are being rapidly exhausted in 
some countries. The principal routes of waste man- 
agement are material recycling, energy recycling and 
chemical recycling. The first opportunity for material 
recycling occurs during manufacture, when uncontam- 
inated waste may be re-used. However, as in the case 
of recycled paper, there is a limit to the number of 
times that this is possible. Recycling of post-consumer 
waste is costly, involving problems of contamination, 
collection, identification and separation. 3 . Co-extruded 

3 German legislation requires that, by 1995, 80% of all 
packaging (including plastics) must be collected separately 
from other waste and 64% of total waste recycled as 
material. 
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blow-moulded containers are being produced with a 
three-layer wall in which recycled material is sand- 
wiched between layers of virgin polymer. In the Ger- 
man car industry efforts are being made to recycle 
flexible polyurethane foam, ABS and polyamides. New 
ABS radiator grilles can incorporate 30% from old 
recycled grilles. 

Plastics have a high content of carbon and hydrogen 
and can be regarded as fuels of useful calorific value. 
Incinerating furnaces act as energy-recycling devices, 
converting the chemical energy of plastics into ther- 
mal/electrical energy and recovering part of the energy 
originally expended in manufacture. Noxious fumes 
and vapours can be evolved (e.g. halogens); control 
and cleaning of flue gases are essential. 

Chemical recycling is of special interest because 
direct material recycling is not possible with some 
wastes. Furthermore, according to some estimates, 
only 20-30% of plastic waste can be re-used after 
material recycling. Chemical treatment, which is an 
indirect material recycling route, recovers monomers 
and polymer-based products that can be passed on as 
feedstocks to chemical and petrochemical industries. 
Hydrogenation of waste shows promise and is used to 
produce synthetic oil. 


11.2 Behaviour of plastics during 
processing 

11.2.1 Cold-drawing and crazing 

A polymeric structure is often envisaged as an entan- 
gled mass of chain molecules. As the T g values 
for many commercial polymers are fairly low, one 
assumes that thermal agitation causes molecules to 
wriggle at ambient temperatures. Raising the tempera- 
ture increases the violence of molecular agitation and, 
under the action of stress, molecules become more 
likely to slide past each other, uncoil as they rotate 
about their carbon- carbon bonds, and extend in length. 
We will first concentrate upon mechanistic aspects 
of two important modes of deformation; namely, the 
development of highly-preferred molecular orienta- 
tions in semi-crystalline polymers by cold-drawing and 
the occurrence of crazing in glassy polymers. 

Cold-drawing can be observed when a semi- 
crystalline structure containing spherulites is subjected 
to a tensile test at room temperature. A neck appears in 
the central portion of the test-piece. As the test-piece 
extends, this neck remains constant in cross-section 
but increases considerably in length. This process 
forms a necked length that is stronger and stiffer 
than material beyond the neck. At first, the effect of 
applied tensile stress is to produce relative movement 
between the crystalline lamellae and the interlamellar 
regions of disordered molecules. Lamellae that are 
normal to the direction of principal stress rotate in a 
manner reminiscent of slip-plane rotation in metallic 
single crystals, and break down into smaller blocks. 


These blocks are then drawn into tandem sequences 
known as microfibrils (Figure 11.4). The individual 
blocks retain their chain-folding conformation and are 
linked together by the numerous tie molecules which 
form as the original lamellae unfold. A bundle of 
these highly-oriented microfibrils forms a fibril (small 
fibre). The microfibrils in a bundle are separated 
by amorphous material and are joined by surviving 
interlamellar tie molecules. The pronounced molecular 
orientation of this type of fibrous structure maximizes 
the contribution of strong covalent bonds to strength 
and stiffness while minimizing the effect of weak 
intermolecular forces. 

Industrially, cold-drawing techniques which take 
advantage of the anisotropic nature of polymer crys- 
tallites are widely used in the production of synthetic 
fibres and filaments (e.g. Terylene). (Similarly, biax- 
ial stretching is used to induce exceptional strength in 
film and sheet and bottles e.g. Melinex). Crystalliza- 
tion in certain polymers can be very protracted. For 
instance, because nylon 6,6 has a T g value slightly 
below ambient temperature, it can continue to crystal- 
lize and densify over a long period of time during nor- 
mal service, causing undesirable after- shrinkage. This 
metastability is obviated by ‘annealing’ nylon briefly 
at a temperature of 120°C, which is below T m : less per- 
fect crystals melt while the more stable crystals grow. 
Stretching nylon 6,6 at room temperature during the 
actual freezing process also encourages crystallization 
and develops a strengthening preferred orientation of 
crystallites. 

Let us now turn from the bulk effect of cold-drawing 
to a form of localized inhomogeneous deformation, or 
yielding. In crazing, thin bands of expanded material 
form in the polymer at a stress much lower than the 
bulk yield stress for the polymer. Crazes are usually 
associated with glassy polymers (PMMA and PS) but 
may occur in semi-crystalline polymers (PP). They are 
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Figure 11.4 Persistence of crystalline block structure in 
three microfibrils during deformation. 
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several microns wide and fairly constant in width: they 
can scatter incident light and are visible to the unaided 
eye (e.g. transparent glassy polymers). As in the stress- 
corrosion of metals, crazing of regions in tension 
may be induced by a chemical agent (e.g. ethanol on 
PMMA). The plane of a craze is always at right angles 
to the principal tensile stress. Structurally, each craze 
consists of interconnected micro voids, 10-20 nm in 
size, and is bridged by large numbers of molecule- 
orientated fibrils, 10-40 nm in diameter. The voidage 
is about 40-50%. As a craze widens, bridging fibrils 
extend by drawing in molecules from the side walls. 
Unlike the type of craze found in glazes on pottery, it is 
not a true crack, being capable of sustaining some load. 
Nevertheless, it is a zone of weakness and can initiate 
brittle fracture. Each craze has a stress-intensifying 
tip which can propagate through the bulk polymer. 
Crazing can take a variety of forms and may even 
be beneficial. For instance, when impact causes crazes 
to form around rubber globules in ABS polymers, 
the myriad newly-created surfaces absorb energy and 
toughen the material. Various theoretical models of 
craze formation have been proposed. One suggestion 
is that triaxial stresses effectively lower T g and, when 
tensile strain has exceeded a critical value, induce a 
glass- to-rubber transition in the vicinity of a flaw, or 
similar heterogeneity. Hydrostatic stresses then cause 
microcavities to nucleate within this rubbery zone. 

As Figure 11.5 shows, it is possible to portray the 
strength/temperature relations for a polymeric material 
on a deformation map. This diagram refers to PMMA 
and shows the fields for cold-drawing, crazing, viscous 
flow and brittle fracture, together with superimposed 
contours of strain rate over a range of 10 -6 to 1 s _1 . 

11.2.2 Processing methods for thermoplastics 

Processing technology has a special place in the 
remarkable history of the polymer industry: polymer 
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Figure 11.5 Deformation map for PMMA showing 
deformation regions as a function of normalized stress 
versus normalized temperature (from Ashby and Jones, 
1986; permission of Elsevier Science Ltd, UK). 


chemistry decides the character of individual molecules 
but it is the processing stage which enables them to be 
arranged to maximum advantage. Despite the variety 
of methods available for converting feedstock pow- 
ders and granules of thermoplastics into useful shapes, 
these methods usually share up to four common stages 
of production; that is, (1) mixing, melting and homog- 
enization, (2) transport and shaping of a melt, (3) 
drawing or blowing, and (4) finishing. 

Processing brings about physical, and often chemi- 
cal, changes. In comparison with energy requirements 
for processing other materials, those for polymers are 
relatively low. Temperature control is vital because 
it decides melt fluidity. There is also a risk of ther- 
mal degradation because, in addition to having limited 
thermal stability, polymers have a low thermal con- 
ductivity and readily overheat. Processing is usually 
rapid, involving high rates of shear. The main methods 
that will be used to illustrate technological aspects of 
processing thermoplastics are depicted in Figure 11.6. 

Injection-moulding of thermoplastics, such as PE 
and PS, is broadly similar in principle to the pressure 
die-casting of light metals, being capable of produc- 
ing mouldings of engineering components rapidly with 
repeatable precision (Figure 11.6a). In each cycle, a 
metered amount (shot) of polymer melt is forcibly 
injected by the reciprocating screw into a ‘cold’ cavity 
(cooled by oil or water channels). When solidifica- 
tion is complete, the two-part mould opens and the 
moulded shape is ejected. Cooling rates are faster 
than with parison moulds in blow-moulding because 
heat is removed from two surfaces. The capital out- 
lay for injection-moulding tends to be high because of 
the high pressures involved and machining costs for 
multi-impression moulding dies. In die design, special 
attention is given to the location of weld lines, where 
different flows coalesce, and of feeding gates. Com- 
puter modelling can be used to simulate the melt flow 
and distributions of temperature and pressure within 
the mould cavity. This prior simulation helps to lessen 
dependence upon traditional moulding trials, which are 
costly. Microprocessors are used to monitor and con- 
trol pressure and feed rates continuously during the 
moulding process; for example, the flow rates into a 
complex cavity can be rapid initially and then reduced 
to ensure that flow-dividing obstructions do not pro- 
duce weakening weld lines. 

Extrusion is widely used to shape thermoplastics 
into continuous lengths of sheet, tube, bar, filament, 
etc. with a constant and exact cross-sectional profile 
(Figure 11.6b). A long Archimedean screw (auger) 
rotates and conveys feedstock through carefully pro- 
portioned feed, compression and metering sections. 
The polymer is electrically heated in each of the three 
barrel sections and Fictionally heated as it is ‘shear- 
thinned’ by the screw. Finally, it is forced through a 
die orifice. Microprocessor control systems are avail- 
able to measure pressure at the die inlet and to keep 
it constant by ‘trimming’ the rotational speed of the 
screw. Dimensional control of the product benefits 
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Figure 11.6 (a) Injection-moulding machine, (b) production 
of plastic pipe by extrusion and (c) thermoforming of plastic 
sheet ( from Mills, 1986; by permission of Edward Arnold). 

from this device. On leaving the die, the continuously- 
formed extrudate enters cooling and haul-off sections. 
Frequently, the extrudate provides the preform for a 
second operation. For example, in a continuous melt- 
inflation technique, tubular sheet of LDPE or HDPE 


from an annular die is drawn upwards and inflated 
with air to form thin film: stretching and thinning 
cease when crystallization is complete at about 120°C. 
Similarly, in the blow-moulding of bottles and air- 
ducting, etc., tubular extrudate (parison) moves ver- 
tically downwards into an open split-mould. As the 
mould closes, the parison is inflated with air at a pres- 
sure of about 5 atmospheres and assumes the shape of 
the cooled mould surfaces. Relatively inexpensive alu- 
minium moulds can be used because stresses are low. 

Thermoforming (Figure 11.6c) is another secondary 
method for processing extruded thermoplastic sheet, 
being particularly suitable for large thin-walled hol- 
low shapes such as baths, boat hulls and car bodies 
(e.g. ABS, PS, PVC, PMMA). In the basic version of 
the thermoforming, a frame-held sheet is located above 
the mould, heated by infrared radiation until rubbery 
and then drawn by vacuum into close contact with the 
mould surface. The hot sheet is deformed and thinned 
by biaxial stresses. In a high-pressure version of ther- 
moforming, air at a pressure of several atmospheres 
acts on the opposite side of the sheet to the vacuum 
and improves the ability of the sheet to register fine 
mould detail. The draw ratio, which is the ratio of 
mould depth to mould width, is a useful control param- 
eter. For a given polymer, it is possible to construct 
a plot of draw ratio versus temperature which can be 
used as a ‘map’ to show various regions where there 
are risks of incomplete corner filling, bursts and pin- 
holes. Unfortunately, thinning is most pronounced at 
vulnerable corners. Thermoforming offers an econom- 
ical alternative to moulding but cycle times are rather 
long and the final shape needs trimming. 

11.2.3 Production of thermosets 

Development of methods for shaping thermosetting 
materials is restricted by the need to accommodate a 
curing reaction and the absence of a stable viscoelas- 
tic state. Until fairly recently, these restrictions tended 
to limit the size of thermoset products. Compres- 
sion moulding of a thermosetting P-F resin ( Bakelite ) 
within a simple cylindrical steel mould is a well-known 
laboratory method for mounting metallurgical samples. 
Resin granules, sometimes mixed with hardening or 
electrically-conducting additives, are loaded into the 
mould, then heated and compressed until crosslinking 
reactions are complete. In transfer moulding, which 
can produce more intricate shapes, resin is melted in 
a primary chamber and then transferred to a vented 
moulding chamber for final curing. In the car indus- 
try, body panels with good bending stiffness are pro- 
duced from thermosetting sheet-moulding compounds 
(SMC). A composite sheet is prepared by laying down 
layers of randomly-oriented, chopped glass fibres, cal- 
cium carbonate powder and polyester resin. The sheet 
is placed in a moulding press and subjected to heat and 
pressure. Energy requirements are attractively low. 

Greater exploitation of thermosets for large car parts 
has been made possible by reaction injection-moulding 
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(RIM). In this process, polymerization takes place dur- 
ing forming. Two or more streams of very fluid chemi- 
cal reactants are pumped at high velocity into a mixing 
chamber. The mixture bottom-feeds a closed chamber 
where polymerization is completed and a solid forms. 
Mouldings intended for high- temperature service are 
stabilized, or post-cured, by heating at a temperature 
of 100°C for about 30 min. The reactive system in RIM 
can be polyurethyane-, nylon- or poly urea-forming. 
The basic chemical criterion is that polymerization in 
the mould should be virtually complete after about 
30 s. Foaming agents can be used to form compo- 
nents with a dense skin and a cellular core. When glass 
fibres are added to one of the reactants, the process is 
called reinforced reaction injection-moulding (RRIM). 
RIM now competes with the injection-moulding of 
thermoplastics. Capital costs, energy requirements and 
moulding pressures are lower and, unlike injection- 
mouldings, thick sections are not subject to shrinkage 
problems (‘sinks’ and voids). Cycle times for RIM- 
thermosets are becoming comparable with those for 
injection-moulded thermoplastics and mouldings of 
SMC. Stringent control is necessary during the RIM 
process. Temperature, composition and viscosity are 
rapidly changing in the fluid stream and there is a chal- 
lenging need to develop appropriate dynamic models 
of mass transport and reaction kinetics. 

11.2.4 Viscous aspects of melt behaviour 

Melts of thermoplastic polymers behave in a highly 
viscous manner when subjected to stress during pro- 
cessing. Flow through die orifices and mould chan- 
nels is streamline (laminar), rather than turbulent, 
with shear conditions usually predominating. Let us 
now adopt a fluid mechanics approach and consider 
the effects of shear stress, temperature and hydro- 
static pressure on melt behaviour. Typical rates of 
strain (shear rates) range from 10— 10 3 s -1 (extrusion) 
to 10 3 -10 5 s -1 (injection-moulding). When a melt is 
being forced through a die, the shear rate at the die 
wall is calculable as a function of the volumetric flow 
rate and the geometry of the orifice. At the necessarily 
high levels of stress required, the classic Newtonian 
relation between shear stress and shear (strain) rate 
is not obeyed: an increase in shear stress produces 
a disproportionately large increase in shear rate. In 
other words, the shear stress/shear rate ratio, which is 
now referred to as the ‘apparent shear viscosity’, falls. 
Terms such as ‘pseudo-plastic’ and ‘shear- thinning’ are 
applied to this non-Newtonian flow behaviour. 1 * The 
general working range of apparent shear viscosity for 
extrusion, injection-moulding, etc. is 10- 10 4 Ns m -2 . 
(Shear viscosities at low and high stress levels are 
measured by cone-and-plate and capillary extrusion 
techniques, respectively.) 

l In thixotropic behaviour, viscosity decreases with increase 

in the duration of shear (rather than the shear rate). 


Figure 11.7 shows the typical fall in apparent shear 
viscosity which occurs as the shear stress is increased. 
If Newtonian flow prevailed, the plotted line would 
be horizontal. This type of diagram is plotted for 
fixed values of temperature and hydrostatic pressure. A 
change in either of these two conditions will displace 
the flow curve significantly. Thus, either raising the 
temperature or decreasing the hydrostatic (bulk) pres- 
sure will lower the apparent shear viscosity. The latter 
increases with average molecular mass. For instance, 
fluidity at a low stress, as determined by the standard 
melt flow index (MFI) test, 1 is inversely proportional 
to molecular mass. At low stress and for a given 
molecular mass, a polymer with a broad distribution of 
molecular mass tends to become more pseudo-plastic 
than one with a narrow distribution. However, at high 
stress, a reverse tendency is possible and the version 
with a broader distribution may be less pseudo-plastic. 

Figure 11.8 provides a comparison of the flow 
behaviour of five different thermoplastics and is useful 
for comparing the suitability of different processes. It 
indicates that acrylics are relatively difficult to extrude 



Figure 11.7 Typical plot of apparent shear viscosity versus 
shear stress for LDPE at 210° C and atmospheric pressure: 
effects of increasing temperature T and hydrostatic pressure 
P shown ( after Powell, 1974; courtesy of Plastics Division, 
Imperial Chemical Industries Pic). 

^his important test, which originated in ICI laboratories 
during the development of PE, is used for most 
thermoplastics by polymer manufacturers and processors. 
The MFI is the mass of melt extruded through a standard 
cylindrical die in a prescribed period under conditions of 
constant temperature and compression load. 
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Figure 11.8 Typical curves of apparent shear viscosity 
versus shear stress for five thermoplastics at atmospheric 
pressure. A Extrusion- grade LDPE at 170° C; B 
extrusion- grade PP at 230 ° C; C moulding- grade acrylic at 
230 ° C; D moulding- grade acetal copolymer at 200 ° C; E 
moulding- grade nylon at 285° C (after Powell, 1974; 
courtesy of Plastics Division, Imperial Chemical Industries 
Pic.). 

and that PP is suited to the much faster deformation 
process of injection-moulding. In all cases, Newtonian 
flow is evident at relatively low levels of shear stress. 

The following type of power law equation has been 
found to provide a reasonable fit with practical data and 
has enabled pseudo-plastic behaviour to be quantified 
in a convenient form: 

r = Cy n (11.2) 

where C and n are constants. Now r = r]y, hence the 
viscosity 37 = Cy n ~ l . The characteristic term (n — 1) 
can be derived from the line gradient of a graphical 
plot of log viscosity versus log shear rate. In practice, 
the power law index n ranges from unity (Newtonian 
flow) to <0.2, depending upon the polymer. This index 
decreases in magnitude as the shear rate increases 
and the thermoplastic melt behaves in an increasingly 
pseudo-plastic manner. 

So far, attention has been concentrated on the vis- 
cous aspects of melt behaviour during extrusion and 
injection-moulding, with emphasis on shear processes. 
In forming operations such as blow-moulding and 
filament-drawing, extensional flow predominates and 
tensile stresses become crucial; for these conditions, 
it is appropriate to define tensile viscosity, the coun- 
terpart of shear viscosity, as the ratio of tensile stress 


to tensile strain rate. At low stresses, tensile viscosity 
is independent of tensile stress. As the level of tensile 
stress rises, tensile viscosity either remains constant 
( nylon 6 , 6 ), rises (LDPE) or falls (PP, HDPE). This 
characteristic is relevant to the stability of dimensions 
and form. For example, during blow-moulding, thin- 
ning walls should have a tolerance for local weak spots 
or stress concentrations. PP and HDPE lack this tol- 
erance and there is a risk that ‘tension-thinning’ will 
lead to rupture. On the other hand, the tensile viscos- 
ity of LDPE rises with tensile stress and failure during 
wall-thinning is less likely. 

11.2.5 Elastic aspects of melt behaviour 

While being deformed and forced through an extru- 
sion die, the melt stores elastic strain energy. As 
extrudate emerges from the die, stresses are released, 
some elastic recovery takes place and the extrudate 
swells. Dimensionally, the degree of swell is typically 
expressed by the ratio of extrudate diameter to die 
diameter; the elastic implications of the shear process 
are expressed by the following modulus: 

H = T/y R (11.3) 

where pi is the elastic shear modulus, r is the shear 
stress at die wall, and y R is the recoverable shear strain. 
The magnitude of modulus /x depends upon the poly- 
mer, molecular mass distribution and the level of shear 
stress. (Unlike viscosity, dependency of elasticity upon 
temperature, hydrostatic pressure and average molec- 
ular mass is slight.) If the molecular mass distribution 
is wide, the elastic shear modulus is low and elastic 
recovery is appreciable but slow. For a narrow distribu- 
tion, with its greater similarities in molecular lengths, 
recovery is less but faster. With regard to stress level, 
the modulus remains constant at low shear stresses but 
usually increases at the high stresses used commer- 
cially, giving appreciable recovery. 

The balance between elastic to viscous behaviour 
during deformation can be gauged by comparing the 
deformation time with the relaxation time or ‘natural 
time’ ( 1 ) of the polymer. X is the ratio of apparent 
viscosity to elastic shear modulus (37//X), and derives 
from the Maxwell model of deformation. The term vis- 
coelasticity originated from the development of such 
models (e.g. Maxwell, Voigt, standard linear solid 
(SLS)). The Maxwell model is a mechanical analogue 
that provides a useful, albeit imperfect, simulation of 
viscoelasticity and stress relaxation in linear polymers 
above T g (Figure 11.9). It is based upon conditions 
of constant strain. A viscously damped ‘Newtonian’ 
dashpot, representing the viscous component of defor- 
mation, and a spring, representing the elastic compo- 
nent, are combined in series. At time t, the stress o is 
exponentially related to the initial stress <t 0 , as follows: 

a = <j 0 exp(— t/X) (11.4) 

where X is the relaxation time. When t X, there 
is sufficient time for viscous movement of chain 
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Figure 11.9 Representation of stress relaxation under 
constant strain conditions ( Maxwell model). 


molecules to take place and stress will fall rapidly. 
When t <$; k, elastic behaviour predominates. The 
magnitude of k ranges from infinity, at the start of 
rubbery behaviour, to zero at the start of viscous 
behaviour. A real polymer contains different lengths of 
molecules and therefore features a spectrum of relax- 
ation times. Nevertheless, although best suited to poly- 
mers of low molecular mass, the Maxwell model offers 
a reasonable first approximation for melts. 

Let us now apply the relaxation time concept to 
an injection-moulding process in which a thermo- 
plastic acrylic at a temperature of 230°C is sheared 
rapidly at a rate of 10 5 s -1 . Assume that the injection 
(deformation) time is 2 s. For the shear rate given, 
Figure 11.8 indicates that the apparent shear viscosity 
is 9 Ns m -2 and the corresponding maximum shear 
stress is 0.9 MN m -2 . At this shear stress, the elastic 
shear modulus for acrylic is 0.21 MN m -2 . The value 
of k(= r]/ pi) is 43 ps, which is very small compared 
to the injection time of 2 s, hence viscous behaviour 
will predominate. A similar procedure can be applied 
to deformation by extrusion. For instance, PP with a 
relaxation time of 0.5 s might pass though the extru- 
sion die in 20 s. The time difference is smaller than 
the previous example of injection-moulding, indicating 
that although deformation is mainly viscous, elastic- 
ity will play a greater part than in injection-moulding. 
The previously-mentioned phenomenon of die swell 
then becomes understandable. (Swelling is equivalent 
to the spring action in the Maxwell model.) Although 
the degree of elastic behaviour may be relatively 
small during injection-moulding and extrusion, it can, 
nevertheless, sometimes cause serious flow defects. 

Relaxation times for extensional flow, as employed 
in blow-moulding, can be derived from the ratio of 
apparent tensile viscosity to elastic tensile modulus 
(E = <j/sr ). Suppose that a PP parison at a temper- 
ature of 230° C hangs for 5 s before inflation with 
air. If the tensile viscosity and tensile modulus are 


36 kN s m -2 and 4.6 kN m~ 2 , respectively, the relax- 
ation time is roughly 8 s. Hence sagging of the parison 
under its own weight will be predominantly elastic. 

11.2.6 Flow defects 

The complex nature of possible flow defects under- 
lines the need for careful product design (sections, 
shapes, tools) and close control of raw materials and 
operational variables (temperatures, shear rates, cool- 
ing arrangements). The quality of processing makes a 
vital contribution to the engineering performance of a 
polymer. 

Ideally, melt flow should be streamlined throughout 
the shaping process. If the entry angle of an extru- 
sion die causes an abrupt change in flow direction, the 
melt assumes a natural angle as it converges upon the 
die entry and a relatively stagnant ‘dead zone’ is cre- 
ated at the back of the die. In this region, the melt 
will have a different thermal history. In addition to its 
dominant shear component, the convergent flow con- 
tains an extensional component that increases rapidly 
during convergence. If the extensional stress reaches a 
critical value, localized ‘melt fracture’ will occur at a 
frequency depending upon conditions. The fragments 
produced recover some of the extensional strain. The 
effect upon the emerging extrudate can range from a 
matt finish to gross helical distortions. The associated 
flow condition is often termed ‘non-laminar’ despite 
the fact that the calculated value of the dimensionless 
Reynolds number is very low. The choice of entry 
angle for the die is crucial and depends partly upon 
the polymer. 

As a melt passes through the die, velocity gradients 
develop, with the melt near the die surface moving 
slower than the central melt. Upon leaving the die, 
the outer layers of extrudate accelerate, eliminating the 
velocity gradient. Above a critical velocity, the resul- 
tant stresses rupture the surface to give a ‘sharkskin’ 
effect which can range in severity from a matt finish 
to regular ridging perpendicular to the extrusion direc- 
tion. ‘Sharkskin’ is most likely when the polymer has 
a high average molecular mass (i.e. highly viscous) 
and a narrow molecular mass distribution (i.e. highly 
elastic); these factors cause surface stress to build up 
rapidly and to relax slowly. Fast extrusion at a low 
temperature favours this defect. Heating of the tip of 
the die lowers viscosity and reduces its likelihood. 

An inhomogeneous melt will produce a non-uniform 
recovery of elastic strain at the cooling surface and 
influence its final texture. Thorough mixing before 
shaping is essential. However, inhomogeneity may 
exist on a molecular scale. For instance, in both 
injection-moulding and extrusion, a broad distribution 
of molecular mass gives a more matt finish than a nar- 
row distribution. Thus, extrusion of a polymer with 
a narrow mass distribution at a rate slow enough to 
prevent the development of ‘sharkskin’ will favour a 
high-gloss finish. 



Plastics and composites 361 


Volatile constituents tend to vaporize, or ‘boil off, 
from the melt if processing temperatures are high. For 
instance, water vapour may derive from hygroscopic 
raw materials. Sometimes, a polymer degrades and 
releases a volatile monomer. Hydrostatic pressure usu- 
ally keeps the volatiles in solution; as the hot polymer 
leaves the die, this pressure is released and the escap- 
ing volatiles form internal bubbles and may pit the 
surface. 

When a polymer is heated to the processing temper- 
ature, the weak intermolecular forces are readily over- 
come by thermal vibrations. Its density may decrease 
by as much as 25%. The subsequent cooling can 
produce shrinkage defects, particularly in crystalline 
polymers, which assume more closely packed con- 
formations than amorphous polymers, and in thick 
sections. Polymers have a low thermal conductivity 
and the hot core can contract to produce depressions 
or ‘sink marks’ in the surface. If the surface layer 
cools rapidly, its rigidity can encourage internal voids 
to form. Careful product design can minimize shrink- 
age problems. Cooling under pressure after injection- 
moulding is beneficial. 

Orientation effects, which are common in shaped 
polymers, have a special significance when the poly- 
mer is ‘reinforced’ with short lengths of glass fibre. (It 
has been estimated that roughly half of the engineer- 
ing thermoplastics are fibre-filled.) In extruded pipes, 
fibres will tend to be aligned parallel to the extrusion 
direction and improve longitudinal strength. However, 
this orientation weakens the pipe transversely and the 
burst strength will suffer. In addition, the tubular form 
necessitates use of a ‘spider’ to support a central die, 
or mandrel, which causes the melt to split and coa- 
lesce before entering the die. The resultant weld lines 
introduce weakening interfaces. A die system is now 
available in which one or both dies are rotated. Their 
shearing action has the beneficial effect of inclining 
fibres at an angle to the extrusion direction. Weld lines 
are also reoriented so that they are placed in shear 
rather than tension when in service. 

11.3 Fibre-reinforced composite 
materials 

11.3.1 Introduction to basic structural 
principles 

11.3.1.1 The functions of filaments and matrix 
In engineering practice, where considerations of mec- 
hanical strength and stiffness are usually paramount, 
the term ‘composite’ is generally understood to refer 
to a material which combines a matrix phase with 
admixed filaments of a reinforcing (strengthening) 
phase. A composite derives from the essentially simple 
and practical idea of bonding together two or more 
homogeneous materials which have very different 
properties. Thus, in the glass-reinforced polymer 
(GRP) known generally as Fibreglass , large numbers 


of short, strong and stiff fibres of glass are randomly 
dispersed in a weaker but tougher matrix of thermoset 
resin. In general, the diameter of reinforcing filaments 
for composites is in the order of 10 fim. They may be 
continuous, extending the full length of a component, 
or short (discontinuous) and may share a common 
orientation, be randomly oriented or even woven into 
cloth. There is a statistical distribution of strength 
values among filaments made from brittle materials 
such as glass, boron and carbon. 

The term ‘composite’ is sometimes taken to include 
materials in which the second phase is in the form 
of particles or laminae. In such cases the composite 
structure may offer special advantages, other than 
strength, such as economy and corrosion-resistance 
(e.g. filler in plastics, plastic-coated steel sheet). The 
recorded history of technology contains numerous 
accounts of innovative, often remarkable, ideas for 
composites. 1 The present account primarily concerns 
the reinforcement of polymeric, metallic and ceramic 
matrices with fibres and ‘whisker’ crystals. Finally, on 
an even finer scale, we will consider application of 
the composite principle to micro structural constituents 
(i.e. nanocomposites). Relatively simple mechanical 
models will be used because they provide sufficient 
introduction to the ground rules of composite design 
and behaviour. 

Although fibres are a striking feature of a composite, 
it is initially helpful to examine the functions of the 
matrix. Ideally, it should be able to (1) infiltrate the 
fibres and solidify rapidly at a reasonable temperature 
and pressure, (2) form a coherent bond, usually 
chemical in nature, at all matrix/fibre interfaces, 
(3) envelop the fibres, which are usually very notch- 
sensitive, protecting them from mutual damage by 
abrasion and from the environment (chemical attack, 
moisture), (4) transfer working stresses to the fibres, 
(5) separate fibres so that failure of individual fibres 
remains localized and does not jeopardize the integrity 
of the whole component, (6) debond from individual 
fibres, with absorption of significant amounts of 
strain energy, whenever a propagating crack in the 
matrix chances to impinge upon them, and (7) remain 
physically and chemically stable after manufacture. 

11.3.1.2 Continuous-fibre composites 
In mechanical terms, the prime function of the matrix 
is to transfer stresses to the fibres (item (4) above) 
because these are stronger and of higher elastic 
modulus than the matrix. The response of a composite 
to applied stress depends upon the respective properties 
of the fibre and matrix phases, their relative volume 
fractions, fibre length and the orientation of fibres 
relative to the direction of applied stress. 

during the Battle of the Atlantic in World War II, Geoffrey 
Pyke proposed the construction of ocean-going aircraft 
carriers from paper pulp/frozen sea water ( Pykecrete ). In 
1985, ice/wood fibres ( Icecrete ) was proposed for wharf and 
off-shore oil platform construction in Norwegian waters. 
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Filaments aligned parallel to applied stress Filaments aligned transversely to applied stress 

Figure 11.10 ‘ Parallel ’ (a) and ‘ series' (b) models of unidirectional filament alignment in composites. 




Some basic principles of the elastic response to 
stress can be obtained from mechanical models in 
which continuous fibres are set unidirectionally in 
an isotropic, void-free matrix (Figures 11.10a and 
11.10b). It will be assumed that the Poisson ratio 
for the fibre material is similar to that of the matrix. 
Using subscript letters c, f, m, 1 and t we can signify 
where property values refer respectively to composite, 
fibre, matrix, longitudinal and transverse directions. 
Thus Vf/Vm is the ratio of volume fractions of fibre 
and matrix, where (1 — Vf) = V m . Certain longitudinal 
properties for a composite can be obtained by using the 
‘parallel’ model shown in Figure 11.10a and applying 
the Rule of Mixtures. For this condition of isostrain, 
stresses are additive and the equations for stress 

(strength) and elastic modulus are: 

+ (11.5) 

E cl = E n V f + E m V m (11.6) 

It is now possible to derive the following relation: 

(crn/crm) = (Vf/V m )(E n /E m ) (11.7) 

Figure 11.11 illustrates this relation, showing that 

as the modulus ratio and/or the volume fraction of 
fibres increase, more and more stress is transferred to 
the fibres. If the modulus ratio is unity, the composite 
must contain at least 50% v/v fibres if the fibres are 
to carry the same load as the matrix. Three typical 
composites A, B and C with 50% v/v reinforcement 
are superimposed on the graph to show the extent to 
which two increases in modulus ratio raise the stress 
ratio. 

An alternative arrangement of fibres relative to 
applied tensile stress is shown in Figure 11.10b. The 
transverse elastic modulus for the composite is given 
by the equation: 

(1 /E ct ) = (Vf/E ft ) + (Y m /E m ) (11.8) 

This ‘series’ version of the Rule assumes a condition 
of isostress and is derived by adding strains: it is less 
accurate than the ‘parallel’ version. Both versions can 
be used to calculate shear moduli and conductivities 
(thermal, electrical). More refined mathematical 
treatments are available: they are particularly helpful 
for transverse properties. Sometimes fibre properties 
are highly anisotropic and this feature influences the 
corresponding value for the composite; for instance, 
Ef t E{] for aramid (Kevlar, Nomex ) and carbon 
fibres. In general, cr fl cr m , £ fl E m and E ft > E m , 



E m 


Figure 11.11 Relation between modulus ratio and stress 
ratio (continuous parallel fibres). 

so that the equations for the ‘parallel’ and ‘series’ 
models express, in mathematical form, the dominant 
effect of fibres on longitudinal properties and the 
dominant effect of the matrix on transverse properties. 

If typical tensile stress versus strain curves for fibre 
and matrix materials (Figure 11.12a) are compared, 
it can be seen that the critical strain, beyond which 
the composite loses its effectiveness, is determined by 
the strain at fracture of the fibres, Sf. At this strain 
value, when the matrix has usually begun to deform 
plastically and to strain-harden, the corresponding 
stress on the matrix is cr' m . Thus, in the related 
Figure 11.12b, it follows that the strength of the 
composite lies between the limits a' m and <r f , depending 
upon the volume fraction of fibres. When a few widely - 
spaced fibres are present, the matrix carries more load 
than the fibres. Furthermore, in accordance with the 
Rule of Mixtures, the strength of the composite falls 
as the volume fraction of fibres deceases. Construction 
lines representing these two effects meet at a minimum 
point, Vmin* Obviously, Vf must exceed F cr i t if the 
tensile strength of the matrix is to benefit from the 
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Figure 11.12 (a) Stress-strain curves for filament and matrix 
continuous filaments. 

presence of fibres. In practical terms, the upper limit 
for Vf is about 0.7 -0.8. At higher values, fibres 
are likely to damage each other. The Rule is only 
applicable when Vf > V m in- 

At the critical volume for fibres, <r m = o c and Vf = 
Vcrit. From the Rule equation we derive: 

Vent = (or m - 0/(or f - cO (11.9) 

In general, a low V cr i t is sought in order to minimize 
problems of dispersal and to economize on the amount 
of reinforcement. Very strong fibres will maximize 
the denominator and are clearly helpful. Strain- 
hardening of the matrix (Figure 11.12a) is represented 
approximately by the numerator of the above ratio. 
Thus, a matrix with a strong tendency to strain-harden 
will require a relatively large volume fraction of fibres, 
a feature that is likely to be very significant for metallic 
matrices. For example, an fee matrix of austenitic 
stainless steel (Fe-18Cr-8Ni) will tend to raise V cr i t 
more than a eph matrix of zinc. 

11.3.1.3 Short-fibre composites 
So far, attention has been focused on the behaviour 
of continuous fibres under stress. Fabrication of 
these composites by processes such as filament- 
winding is exacting and costly. On the other hand, 
composites made from short (discontinuous) fibres 
enable designers to use cheaper, faster and more 
versatile methods (e.g. injection-moulding, transfer- 
moulding). Furthermore, some reinforcements are only 
available as short fibres. At this point, it is appropriate 
to consider an isolated short fibre under axial tensile 
stress and to introduce the idea of an aspect ratio 
(length/diameter = l/d). It is usually in the order of 
10 to 10 3 for short fibres: many types of fibre and 
whisker crystal with aspect ratios greater than 500 are 
available. For a given diameter of fibre, an increase 
in length will increase the extent of bonding at the 
fibre/matrix interface and favour the desired transfer 
of working stresses. As will be shown, it is necessary 
for the length of short fibres to exceed a certain critical 



and (b) dependence of composite strength on volume fraction of 


length 1 if efficient transfer of stress is to take place. 
With respect to diameter, fibre strength increases as the 
diameter of a brittle fibre is reduced. This effect occurs 
because a smaller surface area makes it less likely that 
weakening flaws will be present. 

For the model, in which a matrix containing a short 
fibre is subjected to a tensile stress (Figure 11.13a), 
it is assumed that the strain to failure of the matrix 
is greater than the strain to failure of the fibre. The 
differences in displacement between matrix and fibre 
thus cause shear stress, r, to develop at the cylindrical 
interfaces toward each fibre end. A corresponding 
tensile stress, a, builds up within the fibre. At each 
end of the fibre, these stresses change over a distance 
known as the stress transfer length (1/2); that is, 
the tensile stress increases as the interfacial shear 
stress decreases. In Figure 11.13a, for simplicity, we 
assume that the gradient of tensile stress is linear. If 
the length of the fibre is increased, the peak tensile 
stress coincides with fracture stress for the filament 
(Figure 11.13b). The total length of fibre now has a 
critical value of l c and the transfer length becomes 
IJ 2. If the length is sub-critical, fibre failure cannot 
occur. At the critical condition, the average tensile 
stress on the fibre is only <r f / 2. With any further 
increase in fibre length, a plateau develops in the stress 
profile (Figure 11.13c). The average tensile stress 
on the fibre, which is stated beneath Figure 11.13c, 
approaches the fracture strength <r f as the fibre length 
increases beyond its critical value. In effect, the load- 
carrying efficiency of the fibre is approaching that of 
its direct-loaded continuous counterpart. Provided that 
the shear stresses do not cause ‘pull-out’ of the fibre, 
fracture will eventually occur in the mid-region of 
the fibre. 

The condition of critical fibre length can be 
quantified. Suppose that an increment of tensile force, 
8a, is applied to an element of fibre, 81. The balance 

l A. Kelly introduced the concept of ‘critical fibre length’. 
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= a f / 2 



Figure 11.13 Distribution of tensile stress in a short fibre. <Tf = fracture stress of fibre in tension, o — mean tensile stress. 


between tensile force and interfacial shear force is: 

8a(jTd 2 /4) = t(t rd.8l) (11.10) 

Hence the gradient 8a/8l for the build-up of 
tensile stress is 4r /d. For the critical condition 
(Figure 11.13b), the gradient is <r f divided by the 
critical transfer length / c /2. The critical length l c is 
therefore <r f d/2r. Expressed in terms of the critical 
aspect ratio, the criterion for efficient stress transfer 
takes the form: 

l c /d > <j f /2r (11.11) 

This relation provides an insight into the capabilities 
of short-fibre composites. For instance, for a given 
diameter of fibre, if the interfacial shear strength of 
the fibres is lowered, then longer fibres are needed 
in order to grip the matrix and receive stress. When 
the operating temperature is raised and the shear 
strength decreases faster than the fracture strength 
of the fibres, the critical aspect ratio increases. The 
presence of tensile and shear terms in the relation 
highlights the indirect nature of short- fibre loading: 
matrix strength and interfacial shear strength are 
much more crucial factors than in continuous-fibre 
composites where loading is direct. Interfacial shear 
strength depends upon the quality of bonding and 
can have an important effect upon the overall impact 
resistance of the composite. Ideally, the bond strength 
should be such that it can absorb energy by debonding, 
thus helping to inhibit crack propagation. Interfacial 
adhesion is particularly good in glass fibre/polyester 
resin and carbon fibre/epoxy resin systems. Coupling 
agents are used to promote chemical bonding at the 
interfaces. For example, glass fibres are coated with 
silane (size) which reacts with the enveloping resin. 
Sometimes these treatments also improve resistance to 
aqueous environments. However, if interfacial bonding 
is extremely strong, there is an attendant risk that an 


impinging crack will pass into and through fibres with 
little hindrance. 

Provided that Vf > F cr j t , the tensile strength at 
fracture of a short-fibre reinforced composite can be 
calculated from the previous Rule of Mixtures equation 
by substituting the mean tensile stress term for the 
fracture stress, as follows: 

a c = a(l-l c /2l)V f + a' m V m (11.12) 

It is assumed that the shear strength r remains constant 
and that fibres are perfectly aligned. First, the equation 
shows that short fibres strengthen less than continuous 
ones. For example, if all fibres are ten times the 
critical length, they carry 95% of the stress carried by 
continuous fibres. On the other hand, if their length 
falls below the critical value, the strength suffers 
seriously. For instance, if the working load on the 
composite should cause some of the weaker fibres 
to fail, so that the load is then carried by a larger 
number of fibres, the effective aspect ratio and the 
strength of the composite tend to fall. The same type 
of equation can also be applied to the elastic modulus 
of a short-fibre composite; again the value will be less 
than that obtainable with its continuous counterpart. 
The equation also shows that matrix properties become 
more prominent as fibres are shortened. 

11.3.1.4 Effect of fibre orientation on strength 
Let us reconsider composites in which continuous 
fibres are aligned in the same direction as the 
direction of applied stress. The strength of this highly 
anisotropic type of composite varies with the volume 
fraction of fibres in a linear manner, as shown in 
Figure 11.12b. If the fibres are now oriented at an 
angle 0 to the direction of applied tensile stress, the 
general effect is to reduce the gradient of the strength 
curve for values of Vf greater than V m in, as shown in 
Figure 11.14. This weakening effect is represented by 
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Figure 11.14 Effect of fibre orientation on strength of 
unidirectional composite ( continuous fibres . . 
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inserting an orientation factor 37 in the basic strength 
equation to give: 

v c = WfVf + a' m V m (11.13) 

As 0 increases from zero, rj falls below unity. 

In order to provide a more detailed analysis of the 
variation of composite strength with fibre orientation, 
it is customary to apply a ‘maximum stress’ theory 
based on the premise that there are three possible 
modes of composite failure. Apart from the angle of 
fibre orientation 0, three properties of the composite 
are invoked: the strength parallel to the fibres <r fl , the 
shear strength of the matrix parallel to the fibres r m , 
and the strength normal to the fibres a ft . Each mode 
of failure is represented by an equation which relates 
the composite strength <r d to a resolved stress. 

For the first mode of failure, which is controlled by 
tensile fracture of the fibres, the equation is 

a ci = <r fl sec 2 0 (11.14) 

The equation for failure controlled by shear on a plane 
parallel to the fibres is: 

<r c i = 2r m cosec 20 (11.15) 

As the temperature is raised, this mode of failure 
becomes more likely in off-axis composites because 
the shear strength r m falls more rapidly than <r fl . 

In the third mode of failure, transverse rupture 
occurs, either in the matrix or at the fibre/matrix 
interface (debonding). The relevant equation is: 

<r c i = <T ft cosec 2 0 (11.16) 

Figure 11.15 shows the characteristic form of the 
relation between composite strength and fibre orienta- 
tion. While illustrating the highly anisotropic character 
of the unidirectional continuous reinforcement, it also 
shows the merit of achieving low values of 0. Predic- 
tions, using the maximum stress theory, and experi- 
mental results are in good agreement and confirm the 
general validity of this curve. (Measured values of a fl , 
r m and a ft are required for these calculations.) The 


Figure 11.15 Relation between failure mode, strength and 
fibre orientation ( schematic diagram for unidirectional 
continuous -fibre composite ). 

mode of failure is decided by the equation which gives 
the lowest value of composite strength <r cl . Thus, trans- 
verse rupture becomes dominant when 0 is large. At 
relatively low values of 0, there is a rapid fall in com- 
posite strength which is associated with the transition 
from tensile failure in fibres to failure by shear. Combi- 
nation of the first two of the three equations eliminates 
cr c i and gives the critical angle for this transition: 

<£crit = tan _1 (r m /(Tfl) (11-17) 

If the longitudinal strength is about ten times the shear 
strength of the matrix, then the angle is about 6°. 

When an application involves applied stresses 
that are not confined to one direction, the problem 
of anisotropy can often be effectively solved or 
minimized by using continuous fibres in the form of 
woven cloth or laminates. Although these forms are 
more isotropic than unidirectional composites, there is 
inevitably a slight but usually tolerable loss in strength 
and stiffness. Glass, carbon and aramid fibres are used; 
sometimes two or more different types of fibre are used 
in combination (hybrid composites). Fibre cloth in a 
variety of weave patterns is available. In a nominally 
two-dimensional sheet of woven cloth there is a certain 
amount of fibre oriented in the third dimension. A more 
truly three-dimensional reinforcement, with improved 
through-thickness properties, can be obtained by 
placing woven cloths on top of each other and stitching 
them together with continuous fibre. 

Faminates based on carbon and aramid fibres are 
commonly used for high-performance applications 
that involve complex stress systems (e.g. twisting, 
bending). The unit of construction is a thin ply 
of unidirectional composite, 50-130 pm thick. Plies 
are carefully stacked and oriented with respect to 
orthogonal reference axes (0° and 90°). The simplest 
lay-up sequence is (0/90/90/0). Other more isotropic 
sequences are (0/ + 45/ — 45/ — 45/ + 45/0) and 
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(0/ + 60/ — 60/ — 60/ + 60/0). The stacking of plies 
is symmetrical about the mid-plane of the laminate in 
order to prevent distortion and to ensure a uniform 
response to working stresses. 

Randomly-oriented short fibres of glass are com- 
monly used in sheets and in three-dimensional mould- 
ings. In fact, fibre misorientation is quite common in 
composites, frequently being an unavoidable result of 
fabrication. For instance, when resins loaded with short 
fibres are injection-moulded, the mixture follows com- 
plex flow paths. If the finished moulding is sectioned, 
fibres will clearly indicate the patterns of flow: these 
patterns are determined by melt viscosity, mould con- 
tours and processing conditions. Fortunately, flow pat- 
terns are repeatable from moulding to moulding. Close 
to the surface of the moulding, the short fibres tend to 
follow streamline paths; in the central core, where flow 
is more turbulent, fibres are likely to assume transverse 
orientations. 

11.3.2 Types of fibre-reinforced composite 

11.3.2.1 Polymer-matrix composites 
Glass-reinforced polymers (GRP) date from the 
early 1940s 1 and were the forerunners of present- 
day polymer-, metal- and ceramic-matrix com- 
posites. A typical fabrication procedure for a 
GRP is to add a mixture of polyester resin, 
curing agent and catalyst to fibres of low- 
alkali E-glass (typically 53Si0 2 -18Ca0-14Al 2 0 3 - 
10B 2 O 3 -5MgO). The thermosetting reactions of curing 
are then allowed to take place at a temperature below 
150°C. The relatively low cost, stiffness and ease of 
fabrication of GRP led to their widespread adoption in 
engineering, even for large structures (e.g. storage tanks 
and silos, mine counter-measure vessels). 

Glass fibres for composites are made by allow- 
ing molten glass to pass through the nozzles of an 
electrically heated bushing made from Pt-lORh alloy. 
(The number of nozzle holes in the base of the bush- 
ing is 204, or a multiple of 204.) The filaments 
emerge at a velocity of 50-100 ms -1 and are rapidly 
cooled with a water mist to prevent crystallization, 
hauled over a ‘size’ applicator and finally collected 
by a rotating cylinder (collet). ‘Sizing’ applies a coat- 
ing which loosely bonds the fibres, protects their 
fragile glass surfaces from damage and introduces a 
surface-modifying ‘coupling’ agent to promote even- 
tual fibre/matrix bonding. The primary bundle of con- 
tinuous, untwisted fibres is the unit of collection from 
the bushing and is known as a strand. That is, it con- 
tains 204, 408, 816 or more fibres. (The equivalent 
unit for carbon fibres is called a tow.) Strands can be 
combined to form a larger bundle (roving). Strands or 
rovings are used for unidirectional composites and as 
yarn for weaving; alternatively, they can be chopped 

1 First used in quantity for aircraft radomes which required 
strength, low electrical and thermal conductivity and 
‘transparency’ to radar waves. 


into short lengths of 25-50 mm and oriented ran- 
domly in a plane, (e.g. chopped strand mat for press 
moulding). Interstage ‘sizing’ is often practised. The 
range of fibre diameters is 5-20 pm. Although origi- 
nally developed for electrical applications, E-glass is 
the principal material for the production of continuous 
glass fibres. Other compositions are available, giving a 
higher modulus of elasticity (H-modulus glass), greater 
tensile strength (S -glass), or better alkali resistance 
(AR-glass), etc. 

The principal methods for fabricating polymer- 
matrix composites are (1) hand lay-up or spray-up, 
(2) press-moulding with heated matching dies, (3) vac- 
uum moulding, (4) autoclave moulding, (5) resin 
transfer moulding, (6) reinforced reaction injection 
moulding (RRIM), (7) pultrusion, and (8) filament 
winding. Broadly, these listed methods either bring 
fibres and ‘wet’ resin together during processing or 
use pre-impregnated shapes (pre-pregs) in which fibres 
and thermoset resin are pre-combined. Pre-pregs are 
made by infiltrating rovings, mats or woven fabrics 
with resin and then heating in order to initiate partial 
curing. This B-stage of polymerization is preserved 
until processing by storing at a low temperature. Pre- 
pregs facilitate control of infiltration, orientation and 
fibre content, making it possible to produce polymer- 
matrix composite (PMC) of repeatable quality on an 
automated mass-production basis. In each of the above 
methods there is need to prevent air or vapours becom- 
ing trapped in the composite and forming weakening 
voids. Voids form preferentially at fibre/matrix inter- 
faces and between the plies of layered composites. 
Low-viscosity resins, outgassing and high pressures 
are some of the means used to minimise this porosity. 

Polyester-based matrices remain the principal 
choice for polymer-matrix composites. In the 1970s, 
polyesters became available in the form of moulding 
compounds that are particularly suitable for hot press- 
moulding. Dough moulding compounds (DMC) and 
sheet moulding compounds (SMC) contain roughly 
equal volume fractions of polyester resin, inert filler 
particles and chopped glass fibres. When heated, 
these compounds rapidly become fluid, reproduce the 
contours and details of the moulding dies accurately 
and then cure. This technique is used for domestic 
articles, panels and doors of vehicles, cabinets for 
office and electronic equipment, etc. 

As part of the search for cheaper fabrication meth- 
ods, much effort has been devoted to the develop- 
ment of PMCs with thermoplastic matrices (e.g. nylon 
66, PP, PTFE, PET, polyether sulphone (PES), etc). 
For example, water boxes of car radiators and shell 
housings for street lamps have been made from a 
composite of 33% glass fibres in a nylon 66 matrix 
0 Maranyl ). In the early 1980s, pre-pregs made of 
poly ether ether ketone (PEEK) reinforced with PAN- 
derived carbon fibres ( APC2 ) became available: how- 
ever, they are costly and use tends to be restricted 
to highly- specialized applications (e.g. aircraft com- 
ponents). The general advantages of a thermoplastic 
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matrix are its toughness, indefinite shelf life and, in 
the absence of curing, a shorter time cycle for fab- 
rication. However, during the necessary heating, the 
viscosity of the matrix is higher than that of a ther- 
moset resin, making infiltration between fibres more 
difficult. Furthermore, the pre-pregs are stiff and lack 
the drapability of thermoset pre-pregs which enables 
them to bend easily into shape. 

For the exacting requirements of aerospace and 
high-performance aircraft, the principal PMCs have 
epoxide matrices reinforced with continuous fibres of 
carbon or aramids (Kevlar). The basic advantage of 
epoxides is that they can be used at higher service 
temperatures than polyester matrices. Although the 
T g value of a polymer provides an indication of its 
temperature ceiling, it is substantially higher than the 
maximum temperature for safe service under load. 
For instance, the maximum temperature for a load- 
bearing epoxide matrix is about 160°C, whereas the 
corresponding T g values lie in the range 200- 240° C 
(depending upon the method of determination). The 
search for matrices with a higher temperature capabil- 
ity has led to the development of bismaleimides (BMI) 
and polyimides (PI). These and other new polymers 
raised the ceiling temperature closer to 200° C but have 
sometimes introduced a brittleness problem and they 
can be difficult to process. 

Carbon fibre reinforced polymers (CFRPs) are 
firmly established as construction materials for 
specialized and demanding applications (e.g. helicopter 
rotors, monocoque chassis of racing cars, aircraft floor 
panels, spacecraft components, sports goods, high- 
speed loom components). Laminates of continuous 
carbon fibres (Grafil) are widely used. Carbon fibres 
are also used in metal-matrix and ceramic -matrix 
composites. Frequently, they are combined with other 
types of fibre to form hybrid composites (e.g. glass 
and carbon, aramid ( Kevlar ) and carbon). Carbon 
fibres, 5-10 pm diameter, are available in untwisted 
tows containing 1000, 3000, 6000, 12000 or 120000 
filaments and as pre-pregs with resin. In the UK and 
the USA they are mostly produced from the textile 
polyacrylonitrile (PAN) and its copolymers. Many 
types of PAN-derived carbon fibre are produced (e.g. 
commodity, high-modulus, high-extension, etc.). 

The three-stage process 1 for manufacturing carbon 
fibre is based on the controlled degradation or pyrol- 
ysis of spun fibres of PAN. Hot- stretching is a central 
feature of processing: it counteracts the tendency of the 
fibres to shrink and induces a high degree of molecular 
orientation. The tow is first oxidized under tension at a 
temperature of 200° C and forms a stable, crosslinked 
‘ladder’ structure. In the second stage, heating in an 
inert atmosphere at temperatures between 800° C and 
1600°C carbonizes the structure, releasing vapours and 
gases (hydrogen, nitrogen) and reducing the original 
mass by 40-50%. Finally, the oriented carbon fibres 

1 Originally developed at the Royal Aircraft Establishment 
(RAE), Farnborough, in the 1960s over a five-year period. 


form graphite crystallites during heat-treatment at tem- 
peratures in the range 1300°C to more than 2000°C. 
Raising this treatment temperature encourages graphi- 
tization and improves the elastic modulus but lowers 
the strain to failure. Finally, the fibres are surface- 
treated (e.g. electrolytic oxidation) to improve subse- 
quent bonding to the matrix and ‘sized’ to facilitate 
handling. 

Each carbon fibre produced is very pure and con- 
sists essentially of interwoven ‘ribbons’ of turbostratic 
graphite (Figure 11.16a) and some amorphous carbon. 
The ribbons are aligned parallel to the fibre axis. Being 
an imperfect structure, the amount of porosity is appre- 
ciable. In general terms, a - axes of the planar crystal- 
lites are parallel to the fibre axis, the other a-axis is 
radial or circumferential, and the c-axes are perpendic- 
ular to the fibre axis. As the structure becomes more 
truly graphitic, the ribbon orientation approaches that 
of the fibre axis and the axial modulus increases. The 
fibre structure is highly anisotropic: the moduli of elas- 
ticity along the fibres and perpendicular to the fibres 
are 200-800 GN m -2 and 10-20 GN m -2 , respec- 
tively. (The modulus for E-glass is about 73 GN m -2 .) 

Another textile, rayon, is used as a precursor for 
carbon fibres, but to a lesser extent than PAN. Alter- 
natively, melt-spun filaments of high-purity, meso- 
phase pitch can be oxidized and pyrolized in a process 
broadly similar to the PAN process to yield car- 
bon fibres with a very high modulus approaching 
1000 GN m -2 . Pitch-derived carbon fibre is expensive, 
more difficult to handle than PAN-derived fibres and 
its use is confined to specialized applications. 

Aramid fibres (Kevlar 29 and 49, Twaron ) based 
on aromatic polyamides are important reinforcements 
for polymers. Their linear molecular structure 
(Figure 11.16b) is produced from spun fibre by a 
process of drawing and heating under tension at 
a temperature of approximately 550°C. This linear 
structure, which contrasts with the more planar 
structure of carbon fibres, gives aramid fibres a fibrillar 
character and they can absorb considerable amounts 
of impact energy. When struck with a projectile, 
aramid fibres split into numerous microfibrils, giving 
exceptional ‘stopping power’. This property has 
led to the use of aramid fibres and aramid/resin 
laminates for ballistic applications (e.g. armour). 
The elastic modulus is 50-130 GN m -2 and it is 
stable at temperatures approaching 400° C (depending 
upon the environment). Their mechanical properties 
are degraded by ultraviolet radiation; nevertheless, 
aramid fibres are widely used, particularly in hybrid 
composites. 

The rapid deceleration of racing cars and landing 
aircraft develops very high frictional forces and tem- 
peratures in braking systems: this challenge has been 
met by composites in which a carbonaceous matrix is 
reinforced with carbon fibres. These carbon- carbon 
composites combine the refractory potential of car- 
bon with the high specific strength/stiffness of car- 
bon fibres. PAN or pitch are used as precursors when 
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(a) Fibre axis 
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(b) Fibre axis 



Weak H-bonds Pleated crystals 


Figure 11.16 Structure of (a) carbon fibre and (b) aramid fibre ( from Hughes, June 1986, pp. 365-8; by permission of the 
Institute of Materials) . 


high-modulus fibres are required. Chemical vapour 
deposition (CVD) can be used to produce the matrix 
phase: hydrocarbon gas infiltrates the carbon fibres and 
is thermally ‘cracked’ to form a matrix of pyrolytic 
graphite. C-C composites maintain their strength at 
high temperatures and have high thermal conductiv- 
ity and excellent friction/wear characteristics. They are 
used for aircraft and racing car brakes, rocket noz- 
zles and the heat shield of the Space Shuttle. In the 
presence of oxygen, C-C composites begin to oxi- 
dize and sublime at relatively low temperatures, say 
400° C. Efforts are being made to develop long-life 
multilayer coatings for this type of composite that will 
inhibit oxygen diffusion, ‘self-heal’ and enable service 
temperatures in oxidising environments to be raised to 
1400-1500°C. 


11.3.2.2 Metal-matrix composites 
Large-scale research and development studies of 
metal-matrix composites (MMCs) date back to the 
1960s, being stimulated at that time by the new 
availability of carbon and boron fibres, whisker 
crystals and, more indirectly, by successes achieved 
with reinforced polymer- matrix composites. The aero- 
space and defence industries were attracted by the 
prospects of a new type of constructional material 
possessing high specific strength/stiffness. The fall 
in strength produced by rising temperature is more 
gradual in MMCs than with unreinforced matrix 
material, promising higher service temperatures. In 
addition, the MMC concept offers prospects of unique 
wear resistance and thermal expansion characteristics. 
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For example, parts of space structures are required 
to maintain dimensional stability while being cycled 
through an extremely wide range of temperature. 
One criterion of performance is called the thermal 
deformation resistance, which is the ratio of thermal 
conductivity to the coefficient of thermal expansion. 
The expansion coefficient of graphite fibres with a 
very high elastic modulus is negative, which makes 
it feasible to design continuous-fibre MMCs with a 
zero coefficient of thermal expansion. 

Typical early versions of MMC were boron fibres 
in titanium and carbon fibres in nickel. Over the 
years, interest has ranged from continuous and dis- 
continuous fibres to whisker crystals and particles. 
The nominal ranges of diameter for continuous fibres, 
short fibres and whiskers, and particles are 3-140 pm, 
0-20 pm and 0.5-100 pm. Continuous unidirectional 
fibre-reinforcement gives the greatest improvement in 
properties over those of unreinforced matrix material. 
Interest in whisker reinforcement, once very great, 
has tended to decline because of the carcinogenic 
risks associated with handling small whiskers during 
manufacture and composite fabrication. At present, 
particle-reinforced MMCs find the largest industrial 
application, being essentially isotropic and easier to 
process. Matrices based upon low-density elements 
have gradually come into prominence. The principal 
matrix materials are aluminium and its alloys, titanium 
and its alloys, and magnesium. Most MMCs are based 
on aluminium and its alloys. In recent years, cheaper 
particles and fibres of silicon carbide have become 
available, making them the commonest choice of rein- 
forcement material. Alumina ( Saffil ) reinforcement is 
also used in many MMCs. 

Figure 11.17 illustrates the considerable improve- 
ments in specific tensile strength (longitudinal) and 
specific modulus that result when aluminium alloys are 
reinforced with fibres, whiskers or particles. Property 
changes of this magnitude are unlikely to be achieved 
by more orthodox routes of alloy development. At 
the same time, as the diagram shows, these changes 
are accompanied by rising costs and bring anisotropy 
into prominence. The ratio of longitudinal strength to 
transverse strength can be 15:1 or more for MMCs. 
Although continuous-fibre reinforcement can confer 
maximum unidirectional strength, service conditions 
frequently involve multi-axial stresses. 

A wide and growing variety of methods is avail- 
able for producing MMCs, either as components or as 
feedstock for further processing (e.g. billets for extru- 
sion, rolling, forging). Many of these methods are still 
on a laboratory or development scale. In general terms, 
they usually involve either melting of the matrix metal, 
powder blending or vapour/electrodeposition. Particle- 
reinforced aluminium matrix composites can be made 
by (1) pressing and sintering blends of pre-alloyed 
powder and reinforcement particles (powder blending), 
(2) mechanical alloying (MA), (3) mixing particles 
with molten metal (melt- stirring), (4) compocasting 
(rheocasting) and (5) spray co-deposition. 


Typical reinforcements: 

A = SiC particulate or whisker 
B = SiC or B monofilament 



Figure 11.17 Range of longitudinal tensile specific 
strengths and moduli achieved in aluminium-based 
composites (from Feest, May 1988, pp. 273-8; by 
permission of the Institute of Materials). 

Powder blending techniques (1) are favoured, rather 
than melting, when the metallic matrix has a high 
melting point, thus minimizing the fibre/matrix interac- 
tion problem. After the critical blending operation, the 
MMC powder is canned, vacuum degassed and consol- 
idated by hot-pressing or HIPing. Finally, the MMC is 
worked and shaped (e.g. extrusion, forging). Particle 
volume fractions of 0.25-0.50 are typical. Mechan- 
ical alloying is broadly similar to powder blending; 
the essential difference is that alloying of pure metal 
powders is achieved in high-speed ball mills. In the 
melt-stirring technique (3), the presence of particles 
raises the melt viscosity. Possible difficulties include 
non- wetting of particles, agglomeration and/or gravity- 
settling of particles and unwanted particle/metal inter- 
actions. In the related compocasting process, the stirred 
melt is maintained in a two-phase ‘mushy’ state at tem- 
peratures between solidus and liquidus. The method is 
not applicable to metallic systems with a narrow range 
of solidification. 

Production of particle-reinforced MMCs by spray 
co-deposition (5) is based upon a versatile process 1 
originally developed for building deposits of steels 
that are difficult to cast, nickel-based superalloys 
and copper. A stream of induction-melted metal or 
alloy is broken into fine droplets by relatively cold 
inert gas (nitrogen). Droplets begin to freeze before 
striking a movable collector placed in the line of 
flight. In the MMC variant, reinforcement particles are 

Patented by Osprey Metals, adopted under licence and 
further developed by Alcan International Ltd, Banbury. 
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injected into this stream. Fine-grained MMC deposits 
of SiC/aluminium alloy and Al 2 0 3 /steel can be built as 
plate, tube, billets for hot- working, cladding, etc. The 
pathogenic risks associated with certain types of fine 
particles, fibres and whiskers have necessarily been 
taken into consideration. Filaments of 0.1-3 pm diam- 
eter and with lengths greater than 5 pm are hazardous. 
Accordingly, SiC particle sizes exceed a certain thresh- 
old (e.g. 5 pm). 

The fabrication of continuous-fibre reinforced 
MMCs is difficult and technically demanding. Fibre 
preforms with a high volume fraction of fibres are 
difficult to ‘wet’ and infiltrate; on the other hand, if 
Vf is low, the preform will lack the ‘green’ strength 
required for handling. A high melt temperature will 
lower viscosity and assist infiltration of the fibres 
but increases the risk of fibre/metal reactions. (SiC, 
A1 2 0 3 and carbon fibres react with aluminium alloy 
matrices at a temperature of 500°C.) There is also a 
risk that secondary working operations will damage 
the fibres. In general, the methods for continuous 
fibres are costly, give low production rates and limit 
the size and shape of the MMC product. Typical 
methods for producing continuous-fibre MMCs are 
(1) diffusion bonding (DB), (2) squeeze-casting, and 
(3) liquid pressure forming (LPF). 

Diffusion bonding (1) can be used to consolidate 
metal with continuous fibres of SiC, A1 2 0 3 , boron and 
carbon. Fibres are pre-coated with matrix material or 
carbon. The process conditions must achieve a delicate 
balance between promotion of solid-state diffusion and 
limitation of fibre/matrix reactions. DB has been used 
to produce boron/aluminium struts for the NASA space 
shuttle and SiC/titanium alloy composites. 

In principle, squeeze-casting (2) is a single- shot 
process combining gravity die-casting with closed- 
die forging. It is mainly used for discontinuous-fibre 
MMCs. A metered charge of melt containing short 
fibres is fed into the lowermost of a pair of dies 
and then compressed at high pressure (35-70 MN 
m -2 ) by the descending upper die. The pressure is 
maintained while the charge solidifies. Interpretation of 
the relevant phase diagram for the matrix has to allow 
for the pressure variable. Boundaries in the diagram 
are shifted and, for alloys that contract on freezing, 
the liquidus temperature is raised. With aluminium 
alloys, this shift is about 10-25°C. Thus, the applied 
pressure has an undercooling effect which, together 
with the loss of heat through the dies, favours rapid 
solidification. The high pressure also discourages the 
nucleation of gas bubbles. The final matrix structure 
is accordingly fine-grained and dense. Moderate- sized 
components can be produced at high rates. 

In liquid pressure forming (3), a preheated fibre 
preform is placed within split dies. The dies are closed 
and air pressure in the cavity reduced below 1 mb. 
Molten aluminium casting alloy under pressure is then 
infiltrated upwards through the preform and allowed 
to solidify. LPF uses a lower pressure (<8 MN m -2 ) 
than squeeze-casting and there is less limitation on 


size and shape of product. Particle-reinforced MMCs 
can be produced. Using a single preform and one 
injection shot, it is possible to vary the volume fraction 
and type of fibre (continuous, discontinuous) within a 
component. 

With regard to applications, the emphasis in the 
aerospace industry is the development of MMCs based 
upon titanium or intermetallic compounds such as TiAl 
and Ti 3 Al. For example, coated SiC fibres are used 
to reinforce Ti-6A1-4V alloy. Proposed applications 
for MMCs include compressor discs and blades in 
aero-engines, engine pylons and stabilizers. Channel 
extrusions made from discontinuous-fibre reinforced 
MMC are in use for electrical racking in aircraft. 
In general manufacturing, there is interest in using 
SiC fibre-reinforced aluminium for critical components 
operating at very high speeds where high specific 
stiffness would be mechanically advantageous (e.g. 
carpet-making, food packaging, textile production). 
Co- sprayed steel matrix deposits have been used for 
components of plant handling highly- abrasive materi- 
als (e.g. coke, minerals, wood, fibreboard). 

In the car industry MMCs are now accepted as 
candidate materials for valve rocker arms, connect- 
ing rods, gear selector forks, pulleys, propshafts, etc. 
They have also made it possible to replace cast 
iron engine cylinder blocks with selectively-reinforced 
blocks of aluminium alloy (Honda). The hybrid com- 
posite, which contains 12% v/v alumina particles (for 
strength) and 9% v/v carbon fibres (for lubricity), saves 
weight, improves the power rating and dispenses with 
cast iron cylinder liners. 

In diesel engines 2 the achievement of higher 
combustion efficiencies, with better fuel economy and 
reduction in the exhaust emissions of undesirable 
gases and particulate matter, has resulted in higher 
temperatures and peak pressures in the combustion 
chambers. The standard alloy for diesel pistons is 
Al- 12Si- lCu- lMg (LM 13 in BS1490: Lo-Ex). 
Although this eutectic alloy is satisfactory for a 
working temperature range of 250-300°C, advances 
in engineering design have raised crown temperatures 
to 300- 350° C. For instance, in one relatively new 
design feature, now established for direct-injection 
engines, a deep combustion bowl is located in the 
crown of each piston. This turbulence-inducing cavity 
usually has re-entrant angles and sharp edges. The 
higher mechanical and thermal loading on the bowl lip 
can cause cracking. This problem has been solved by 
squeeze-casting the piston body and, at the same time, 
incorporating a fibre-reinforced bowl. A controlled 
quantity of aluminium alloy melt is fed into an 
open die, which contains a preformed compact of 
alumina fibres, and then compressed with a pressure 
of 120-150 MN m -2 by a hydraulically-actuated plug 
die, so that the fibres are infiltrated, ‘wetted’ and 

2 The first engine patent of Rudolf Diesel (1858-1913) was 
officially authenticated in 1893. 
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bonded to the alloy. (The fibres are 2-4 pm diameter, 
200-500 pm long, and occupy at least 20% v/v of 
the composite) Squeeze-casting prevents shrinkage and 
dissolved gases from causing microporosity and gives 
a fine-grained structure with better high-temperature 
fatigue properties than gravity-casting. The MMC 
structure provides the bowl lips with the necessary hot 
strength and resistance to cracking. 

Figure 11.18 shows a typical aluminium alloy piston 
design, with local reinforcement of the combustion 
bowl and the top ring groove. Accepted practice for 
the large pistons of heavy-duty and turbocharged diesel 
engines is to reinforce this groove, and possibly the 
second groove, with a wear-resistant insert of nickel- 
rich, and hence austenitic, cast iron ( Ni-Resist ) which 
is bonded to the aluminium alloy body by the Alfin 
process. In small high-speed indirect-injection diesel 
engines, which have a pre-combustion chamber, lighter 
and cheaper MMC inserts have been used successfully 
to reinforce the groove. The performance of this insert 
is crucial because modern designs are tending to locate 
the top groove much closer to the crown, even in the 
‘headland’ position, thus reducing the volume of the 
annular ‘dead space’ above the ring and giving fuel 
economy and reduced emissions. 

When considered in terms of the costly effort that 
has been put into development of MMCs, commercial 
exploitation has been disappointingly limited. MMC 
structures and components face strong competition 
from alternative, more conventional engineering mate- 
rials. Exploitation has been mainly hindered by the 
high cost of reinforcement materials, particularly con- 
tinuous fibres, and fundamental features of MMC 


I 



Figure 11.18 Localized MMC reinforcement of piston head 
(courtesy of R. Munro and AE Piston Products, T & N 
Group). 


structures that tend to make fabrication difficult and 
service performance questionable. 

During fabrication, formation of the matrix often 
necessitates a high processing temperature (e.g. melt 
infiltration). This temperature can be high enough to 
promote unwanted chemical reactions at the interfaces 
between matrix and reinforcement. These conditions 
may arise during service at elevated temperatures, of 
course. SiC-reinforced titanium is an example of a 
composite in which brittle interfacial products tend to 
form (i.e. silicides and carbides). Ideally, the aim is 
to develop a strong interfacial bond without degrading 
the fibre or forming weakening phases at the interfaces. 
The coating of fibres is generally regarded as the most 
promising means to control chemical interaction at 
reinforcement/matrix surfaces. However, it is difficult 
to develop a coating with long-term stability. 

As might be anticipated, the deliberate introduction 
of short stress-concentrating fibres into a metal or alloy 
tends to reduce the ductility and toughness below that 
of the unreinforced matrix. When a short-fibre MMC 
is deformed by applied stress, the amount of strain in 
fibre and matrix may differ substantially, leading to 
rupture of the interfacial bonding at the end regions 
of fibres. This debonding results in the nucleation and 
rapid growth of voids. Void formation is the dominant 
mode of tensile failure in SiC/aluminium composites. 
A difference in the coefficients of thermal expansion of 
fibre and matrix can produce the same effect, possibly 
during the cooling stages of composite fabrication or 
during thermal excursions and cycling in service. Thus, 
because off < a m , fibres and particles develop residual 
compressive stress and the matrix is left in a state 
of tension. This disparity in expansion characteristics 
between the reinforcement phase and the matrix also 
helps to generate a high density of dislocations in 
the matrix. Plastic deformation of the matrix involves 
dislocation movement across slip planes. If a slip 
plane is threaded by fibres, a glissile dislocation will 
bow between them and recombine beyond the fibres, 
leaving an Orowan loop around each fibre. These loops 
reduce the effective gap between fibres and successive 
dislocations are forced to assume smaller radii of 
curvature as they bypass the fibres. In combination 
with a high dislocation density and a fine matrix grain 
size, this mechanism can produce a high rate of initial 
strain-hardening in the matrix. 

The presence of extremely hard reinforcement par- 
ticles or fibres can benefit wear-resistance in service 
but leads to difficulties during the finish-machining 
operations that are usually needed for MMC products. 
These particles act as chip-breakers during machin- 
ing of materials, such as SiC-reinforced aluminium 
alloys, and reduce cutting forces: unfortunately, they 
also shorten tool life significantly, even when tools are 
tipped with poly crystalline diamond (PCD). Machin- 
ing tends to generate sub- surface damage, a matter of 
special concern in the preparation of mechanical test- 
pieces. 
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Despite these formidable problems, the basic idea 
of reinforcing a strong deformable matrix with elastic 
fibres retains its appeal to the aerospace, defence and 
automotive industries and active research on MMCs 
continues worldwide. There is a need to expand and 
consolidate the database for properties of MMCs (e.g. 
fracture toughness, fatigue resistance, corrosion resis- 
tance, etc.). 1 

11.3.2.3 Ceramic-matrix composites 
Reinforcement of cements and concretes with short fil- 
aments of glass, steel or carbon is applied in the build- 
ing and construction industries. Costs are very closely 
scrutinized in these industries and the increase in cost 
associated with fibre reinforcement is often regarded 
as unacceptable unless there is promise of exceptional 
properties. In one example, a 2% v/v dispersion of 
steel wires (low-carbon or austenitic stainless) in a 
concrete matrix ( Wirand Concrete 2 ) confers improved 
resistance to crack propagation and greater flexural 
and compressive strength. These wires are typically 
0.25-0.5 pm diameter, with an aspect ratio of 100:1. 
The resistance of this composite to salt water and to 
pebble abrasion have led to its use for 40 t twin-fluked 
shapes (dolosse) for shore defences against heavy seas. 

Since the 1980s there has been a sustained inter- 
est in the development of ceramic-matrix compos- 
ites (CMCs) for high-temperature applications that 
demand exceptional strength and chemical stability 
(e.g. advanced engine designs, cutting tools). Ceramics 
in general are inherently notch- sensitive and brittle: the 
introduction of fibres is primarily intended to improve 
the toughness of the matrix and reliability in service. 
(In polymer-matrix and metal-matrix composites, rein- 
forcement is used to improve strength, stiffness and, 
if possible, toughness.) Currently, the principal matri- 
ces are inorganic glasses and glass-ceramics but it 
is also possible to use engineering ceramics, such as 
alumina and silicon nitride. Reinforcement materials, 
which include silicon carbide ( Nicalon , Tyranno ), alu- 
mina and carbon, have taken the form of continuous 
fibres, short fibres, whisker crystals, woven fibre cloth 
and laminates. 

Failure in CMC structures is complex in character 
and a variety of mechanisms has been identified. It is 
not possible to apply directly the theory of fracture 
mechanics that was developed for monolithic ceramic 
bodies. Researches on CMC systems using continuous 
fibres primarily aim to achieve substantial improve- 
ments in toughness and reliability. To this end, there 
is particular interest in structures that have a capac- 
ity for accumulating damage and that eventually fail 

l ln the early 1980s, MMCs were classed as strategic 
materials by the US government: samples and detailed 
information are consequently unavailable. 

2 Patent based on researches of Professor James P. Romualdi 
of the Carnegie-Mellon University and the Battelle 
Development Corporation, USA, in the 1960s). 
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Figure 11.19 Stages of failure in CMC reinforced with 
continuous fibres (from King, Dec 1989, pp. 720-6; by 
permission of the Institute of Materials). 

in a gradual (‘graceful’) manner, rather than suddenly 
and catastrophically. This overload capability is illus- 
trated schematically in Figure 11.19, which gives the 
tensile stress versus strain curve for a unidirectional 
composite. It will be noted that the fracture strain 
of the unreinforced matrix is considerably less than 
that of the composite. Superimposed on the curve 
for the composite are the principal micromechanisms 
of failure, namely (1) microcracking of the matrix, 

(2) interfacial failure as fibre pull-out takes place and 

(3) fibre fracture. 

In general terms, events taking place in the wake of 
each crack tip become more significant than flaws in 
the matrix. Thus, unbroken fibres bridge the cracks in 
the matrix, oppose opening separation of these cracks 
and reduce the effective stress intensity factor at the tip 
of the crack. As the matrix cracks, load is increasingly 
transferred to the fibres. A high E f /E m ratio and a high 
content of fibres favour this transfer. Volume fractions 
up to 0.5 -0.6 are quoted in reports. 

Debonding at the fibre/matrix interfaces is clearly 
an important aspect of the overall fracture process. 
Strong bonding assists the transfer of load from matrix 
to fibres and increases the amount of energy absorbed 
during fibre pull-out. However, if this bond strength 
is exceptionally strong, cracks may travel in an unhin- 
dered, brittle manner through matrix and fibres alike, 
with the CMC behaving like a monolithic ceramic. On 
the other hand, if bonding is relatively weak, the course 
of an incident crack may be deflected into the interfa- 
cial zone, with useful absorption of energy. For these 
reasons, special attention is given to ways of control- 
ling and/or modifying the surface character of fibres. 

The degree to which the linear coefficients of ther- 
mal expansion (a) of fibre and matrix materials match 
is an important aspect of CMC design. For instance, 
a mismatch between a-values can favour the type of 
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crack-deflection process just described. Again, if short 
fibres are used and there is a substantial mismatch in 
o'-values, cracks will tend to be initiated at the ends of 
the fibres. Thermal cycling will aggravate this type of 
damage. A mismatch in the expansion characterisics 
of well-bonded fibres and matrix can also introduce a 
‘pre-stressing’ effect that inhibits crack growth. Thus, 
if off > a m , then cooling from a high temperature will 
induce residual tensile stress in the fibres and residual 
compressive stress in the matrix. This imbalance has 
a toughening effect because microcracks advancing in 
the matrix have to overcome compressive stress. 

Scanning electron microscopy provides valuable 
information on deformation and failure mechanisms 
in CMC structures. A sample of composite can be 
imaged in situ with high resolution while being sub- 
jected to tensile, bending, compressive or cyclic stress. 
Figure 11.20a shows a SiC fibre in a matrix of cal- 
cium aluminosilicate during an early stage of fatigue 
failure, with the fibre bridging the microcrack in the 
matrix. Eventually, as Figure 11.20b shows, the fibre 
itself fractures after a certain amount of pull-out from 
the matrix. 

Fabrication of CMCs is expensive, demanding very 
close control. During fabrication of a continuous fibre 
CMC, it is necessary to infiltrate the fibres, which may 
be in the form of woven cloth, with a fluid form of the 
matrix-building material. The most common method 
uses a slurry of ceramic particles, a carrier liquid 
(water or alcohol), an organic binder and, sometimes, 
a surface-wetting agent. After drying, the shape is hot- 
pressed. A range of alternative production routes is 
now available (e.g. sol -gel reaction, melt infiltration, 
chemical vapour deposition (CVD), reaction- sintering, 
transfer-moulding). Unique fabrication problems can 
arise. For instance, during high-temperature firing, 
chemical reaction at the fibre/matrix interfaces may 
cause weakening intermediate phases to form. With 
regard to industrial utilization, there is still a widely- 
accepted need to improve toughness and reliability 
further. Although some CMCs can be used in air 
at temperatures approaching 1000°C, oxidation is a 
potential problem when carbon and SiC fibres are 
used; fibre oxidation may occur at temperatures as 
low as 350-400°C. Coatings have proved effective in 
preventing this wasting attack and in raising the ceiling 
temperature for the composite. Alternatively, an inert 
gaseous environment may be used but is obviously not 
always feasible. Glassy matrices are favoured because 
processing is generally simpler; unfortunately, it is 
possible for certain glasses to begin to soften in the 
temperature range 500-800°C. 

As the foregoing remarks indicate, a number of 
problems are associated with the production and uti- 
lization of ceramic-matrix composites. Commercial 
exploitation of CMCs to date has consequently been 
rather restricted. Combinations that have found appli- 
cations in engineering are SiC whiskers/polycrystalline 
alumina (cutting-tool tips), graphite fibres/borosilicate 



(b) 

Figure 11.20 Field- emission scanning electron micrographs 
of fatigue cracking in SiC fibre/C a aluminosilicate composite 
(fibre diameter 15 pm) (courtesy of M. D. Halliday, 
Interdisciplinary Research Centre, University of 
Birmingham, UK). 

glass (substrate of laser mirrors), and graphite/glass- 
ceramic (bearings, seals and brakes). The SiC 
fibre/lithium aluminosilicate (LAS) glass composite is 
a candidate material for new types of heat-engine. 

11.3.2.4 In-situ composites and nanocomposites 
Alloy microstructures, such as eutectics and eutectoids, 
containing a dispersion of fibres or lamellae within a 
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matrix phase are well-known products of phase trans- 
formation. They form an intimate combination of two 
or more quite different phases and, understandably, 
there has been strong interest in developing in-situ 
(self-assembled) composites which offer the advan- 
tages of the orthodox composites produced by adding 
a matrix phase to preformed fibres. This new class 
of materials, with its micron-scale structures, offers 
the prospect of unique physical (electrical, thermal, 
magnetic), mechanical and chemical properties. Pro- 
vided that the alloy system is carefully chosen and 
manipulated, it is possible to produce unidirection- 
ally solidified structures with a uniform distribution 
of aligned fibres over extended distances. The prob- 
lem of fibre/matrix interaction so often associated with 
conventional fabrication routes is eliminated. Eutectic 
systems are immediately attractive because the solid 
phases produced simultaneously by this type of reac- 
tion are essentially in thermodynamic equilibrium and 
are held together by strong interfacial bonds. Fibres 
are preferred to lamellae because the latter are more 
likely to be weakened by the presence of transverse 
cracks. 

In-situ composites are commonly produced by the 
classical laboratory technique in which molten alloy is 
withdrawn vertically downwards from a furnace held 
at a constant temperature. The rate of withdrawal is 
necessarily very low for some alloys (<10 mm h -1 ), 
increasing the risk that the materials of containment 
will contaminate the melt. Because of the low produc- 
tion rates, costs tend to be high. As the eutectic reaction 
proceeds, solid phases nucleate and grow perpendicu- 
lar to the horizontal melt/eutectic interface in a cooper- 
ative manner. Interdiffusion of alloying elements takes 
place in the melt adjacent to this advancing interface. 
If both matrix and reinforcement phases have a low 
entropy of fusion, the eutectic structure grows in a 
uniform manner and the interface is essentially planar. 
Fortunately, most alloy systems of industrial interest 
tend to freeze with this morphology when solidifi- 
cation is closely controlled, even in cases where the 
reinforcing phase (e.g. carbide) has a known tendency 
to form crystallographically oriented facets because of 
its high entropy of fusion. When both phases have a 
high entropy of fusion, each phase has a strong ten- 
dency to form facets and the final structure is likely 
to be very irregular with much branching. This last 
combination is intrinsically unsuitable for producing 
in-situ composites. 

Severe limitations on phase composition and phase 
ratio may apply to these melt- grown composites. For 
instance, binary alloy systems can often only develop 
a relatively small volume fraction of a strong fibrous 
phase. In the Ni-Al system, which has been exten- 
sively studied, the volume fraction of strong Al 3 Ni 
filaments within the aluminium-based matrix is 0.1. 
(Beyond a certain volume fraction in a given system, 
laminae have a lower interfacial energy and tend to 
form in preference to fibres.) Complex ternary alloy 
systems are more promising: a monovariant eutectic 


reaction at a ‘valley’ line in the liquidus surface of the 
Co-Cr-C system can produce 30% v/v carbide fibres 
in a solid solution matrix of cobalt and chromium. 
For high-temperature applications, such as turbine 
blades in aero engines, in-situ composites can compete 
with directionally-solidified (DS) nickel-based superal- 
loys and single-crystal superalloys (e.g. stress-rupture 
properties in directions parallel to the reinforcement). 
Aligned two-phase structures can be grown from the 
alloy melt; phase compositions can be varied quite 
widely without preventing such structures from form- 
ing. There is also valuable scope for alloying additions 
(Cr, Al, W, Re) which enhance properties such as 
resistance to oxidation, corrosion, creep and thermal 
fatigue. Examples are the NITAC and COTAC series 
of eutectic superalloys, which are based upon nickel 
and cobalt, respectively. In typical NITAC alloy, the 
ductile matrix consists of y-Ni and a fine precipitate of 
Ni 3 Al particles (gamma-prime phase, y '), and is rein- 
forced with a carbide phase (e.g. TaC, Cr 3 C 2 , NbC). 

In service, the integrity of in-situ composites can be 
jeopardized by oxidative attack, temperature gradients 
and/or thermal cycling. The last condition is potentially 
disruptive because a mismatch in the thermal expan- 
sion characteristics of the two phases will cause stiff 
fibres to fracture and the ductile matrix phase to fail 
by fatigue. 

In the late 1980s, researchers began to extend the 
composite principle beyond the micron scale and into 
the nanometre range (1-50 nm). The ‘jumps’ in vari- 
ous properties achieved with these new nanocomposite 
microstructures were considerable. An early prototype 
was a nominally single-phase ceramic structure based 
upon a conventional poly crystalline matrix of micron- 
sized alumina grains. A uniform dispersion of 5% sili- 
con carbide inclusions, about 10 nm in size, throughout 
this matrix gives a fourfold increase in strength. The 
processing route for this ceramic is based upon estab- 
lished sintering technology: the main stages are the 



Figure 11.21 SiC inclusion/alumina nanocomposite ( from 
Brook and MacKenzie, Jan 1993, pp. 27-30; by permission 
of the Institute of Materials). 



Plastics and composites 375 


grinding and mixing of fine /3 - SiC and a-A\ 2 0 3 pow- 
ders, hot-pressing (1750°C) and annealing (1300°C). 
The structural character of the ceramic is shown 
schematically in Figure 11.21. It has been proposed 
that the large number of dislocations present form a 
sub- structure within each grain, with SiC particles at 
the nodal points. In general terms, this combination of 
micron- sized grains and much finer nanometre- scale 
sub- structures gives exceptional strength and, impor- 
tantly in the case of ceramics, good reproducibility. 

Study of nanoscale structures has been extended to 
a wide variety of metallic systems. Again, fabrication 
is based on pure, ultrafine powders. These powders 
are mainly produced by either condensation from a 
vapour, which minimizes the risk of contamination, or 
high-energy ball-milling. Two or more metal powders 
are compacted at high pressure (1-5 GN trr 2 ) to 
give a unique nanometre- scale grain structure. An 
extremely large proportion of atoms are sited at or near 
grain boundary surfaces and properties dependent upon 
grain boundary characteristics come into prominence. 
For instance, these multiphase metallic nanostructures 
are stable and highly resistant to grain growth. The 
grains are randomly-oriented and fairly uniform in 
size (say, 5-10 nm). Contiguous grains can differ in 
chemical composition. 

Ceramic and metallic nanocomposites can display 
unusual properties and behaviour that are not explica- 
ble in terms of conventional scientific theory. Thus cur- 
rent explanations of dislocation behaviour, diffusion, 
solid solubility and fracture in nanoscale structures are 
often tentative. 
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Chapter 12 


Corrosion and surface engineering 


12.1 The engineering importance of 
surfaces 

The general truth of the engineering maxim that ‘most 
problems are surface problems’ is immediately appar- 
ent when one considers the nature of metallic corrosion 
and wear, the fatigue-cracking of metals and the effect 
of catalysts on chemical reactions. For instance, with 
regard to corrosion, metal surfaces commonly oxi- 
dize in air at ambient temperatures to form a very 
thin oxide film (tarnish). This ‘dry’ corrosion is lim- 
ited, destroys little of the metallic substrate and is not 
normally a serious problem. However, at elevated tem- 
peratures, nearly all metals and alloys react with their 
environment at an appreciable rate to form a thick non- 
protective oxide layer (scale). Molten phases may form 
in the scale layer, being particularly dangerous because 
they allow rapid two-way diffusion of reacting species 
between the gas phase and the metallic substrate. ‘Wet’ 
or aqueous corrosion, in which electrochemical attack 
proceeds in the presence of water, can also destroy 
metallic surfaces and is responsible for a wide variety 
of difficult problems throughout all branches of indus- 
try. The principles and some examples of ‘dry’ and 
‘wet’ corrosion will be discussed in Section 12.2. 

Conventionally, the surface properties of steels are 
improved by machining to produce a smooth sur- 
face texture (superfinishing), mechanically working 
(shot-peening), introducing small atoms of carbon 
and/or nitrogen by thermochemical means (carbur- 
izing, nitriding, carbo-nitriding), applying protective 
coatings (galvanizing, electroplating), chemically con- 
verting (anodizing), etc. Many of these traditional 
methods employ a liquid phase (melt, electrolyte). In 
contrast, many of the latest generation of advanced 
methods for either coating or modifying material sur- 
faces use vapours or high-energy beams of atoms/ions 
as the active media. Their successful application on a 
commercial scale has revealed the merits of developing 
a new philosophy of surface design and engineering. In 


Section 12.3, we examine some typical modem meth- 
ods for improving surface behaviour. 

12.2 Metallic corrosion 

12.2.1 Oxidation at high temperatures 

12.2.1.1 Thermodynamics of oxidation 

The tendency for a metal to oxidize, like any other 
spontaneous reaction, is indicated by the free energy 
change AG accompanying the formation of the oxide. 
Most metals readily oxidize because AG is negative 
for oxide formation. The free energy released by the 
combination of a fixed amount (1 mol) of the oxidizing 
agent with the metal is given by A G° and is usually 
termed the standard free energy of the reaction. A G° 
is, of course, related to A H°, the standard heat of 
reaction and AS 0 the standard change in entropy, by 
the Gibbs equation. The variation of the standard free 
energy change with absolute temperature for a number 
of metal oxides is shown in Figure 12.1. The noble 
metals which are easily reduced occur at the top of the 
diagram and the more reactive metals at the bottom. 
Some of these metals at the bottom (Al, Ti, Zr), 
however, resist oxidation at room temperature owing 
to the impermeability of the thin coherent oxide film 
which first forms. 

The numerical value of A G° for oxidation reactions 
decreases with increase in temperature, i.e. the oxides 
become less stable. This arises from the decreased 
entropy accompanying the reaction, solid (metal) + 
gas (oxygen) -> solid (oxide). The metal and oxide, 
being solids, have roughly the same entropy values 
and d(AG)/dT is thus almost equivalent to the entropy 
of the oxygen, i.e. 209.3 J deg -1 mol -1 . Most of 
the AG versus T lines therefore slope upwards at 
about this value, and any change in slope is due to 
a change in state. As expected, melting has a small 
effect on AS and hence AG, but transitions through 
the boiling point (e.g. ZnO at 970° C) and sublimation 
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Figure 12.1 Standard free energies of formation of oxides. 


(e.g. Li 2 0 at 1330°C) have large effects. Exceptions from one mole of oxygen and hence AS — 0; the C0 2 

to the positive slope of the AG versus T line occur free energy line is thus almost horizontal. The carbon 

for carbon oxidation to CO or C0 2 . In both cases the monoxide reaction is favoured at high temperatures 

oxide product is gaseous and thus also has a high free and consequently carbon is a very effective reducing 

energy. In the reaction 2C + 0 2 2CO, two moles agent, having a greater affinity for oxygen than most 

of gas are produced from one of oxygen so that oxides. 

Because of the positive slope to the A G° versus T 
AS = (Soxide — ^carbon — ^oxygen) line f or mos t oxides in Figure 12.1, A G° tends to 

— (os° .. — s° u — <>° ^ zero at some elevated temperature. This is known 

as the standard dissociation temperature when the 
The CO free energy versus temperature line has a oxide is in equilibrium with the pure element and 

downward slope of approximately this value. For the oxygen at 1 atm pressure. In the case of gold, the 

C + 0 2 -> C0 2 reaction, one mole of C0 2 is produced oxide is not stable at room temperature, for silver 
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Ag 2 0 dissociates when gently heated to about 200° C, 
and the oxides of the Pt group of metals around 
1000°C. The other oxides dissociate at much higher 
temperatures. However, the temperature is affected 
by pressure since the free energy per mole of any 
gaseous phase varies with pressure P(atm) according 
to G(P) = G° + RTlnP, whereas that for the solid 
phase is relatively unaffected. Thus, for the metal + 
oxygen -> metal oxide reaction under standard con- 
ditions, A G = G oxide G metal G oxygen? and at P 
atm oxygen, AG = A G° — RrinP 02 - The reaction is 
in equilibrium when AG = 0 and hence 

p 02 = exp[AG°/Rr] (12.2) 

is the equilibrium dissociation pressure of the oxide 
at the temperature T. If the pressure is lowered below 
this value the oxide will dissociate, if raised above, the 
oxide is stable. The common metal oxides have very 
low dissociation pressures ~10 _10 N/m 2 (~10 -15 atm) 
at ordinary annealing temperatures and thus readily 
oxidize in the absence of reducing atmospheres. 

The standard free energy change A G° is also related 
to the equilibrium constant K of the reaction. For the 
reaction discussed above, i.e. 

Me + 0 2 -> Me0 2 

the equilibrium constant K = [Me0 2 ]/[Me][0 2 ] deri- 
ved from the law of mass action. The active masses 
of the solid metal and oxide are taken equal to unity 
and that of the oxygen as its partial pressure under 
equilibrium conditions. The equilibrium constant at 
constant pressure, measured in atmospheres is thus 
K P = l/Po 2 - It then follows that A G° = — RTlnKp. 
To illustrate the use of these concepts, let us consider 
the reduction of an oxide to metal with the aid of a 
reducing agent (e.g. Cu 2 0 by steam). For the oxidation 
reaction. 

4Cu + 0 2 — > 2 Cu 2 0 

and from Figure 12.1 at 1000 K, AG° = —0.195 MJ/ 
mol = l/Po 2 , giving Pq 2 = 6.078 x 10 -6 N/m 2 . At 
1000 K the equilibrium constant for the steam reaction 
2H 2 0 — > 2H 2 T 0 2 is 

K = Po 2 Pl 2 /Pl 2 o = 1-013 x 10 -15 N/m 2 

Thus to reduce Cu 2 0 the term P Ql < 6 x 1CU 11 in 
the steam reaction gives Fh 2 /Fh 2 o > 10 -5 , so that 
an atmosphere of steam containing 1 in 10 5 parts of 
hydrogen is adequate to bright-anneal copper. 

In any chemical reaction, the masses of the reactants 
and products are decreasing and increasing respec- 
tively during the reaction. The term chemical potential 
/x(= dG/dn) is used to denote the change of free 
energy of a substance in a reaction with change in 
the number of moles n , while the temperature, pres- 
sure and the number of moles of the other substances 
are kept constant. Thus, 

li, = tf + RTlnP, (12.3) 


and the free energy change of any reaction is equal to 
the arithmetical difference of the chemical potentials 
of all the phases present. So far, however, we have 
been dealing with ideal gaseous components and pure 
metals in our reaction. Generally, oxidation of alloys is 
of interest and we are then dealing with the solution of 
solute atoms in solvent metals. These are usually non- 
ideal solutions which behave as if they contain either 
more or less solute than they actually do. It is then 
convenient to use the activity of that component, a t , 
rather than the partial pressure, P h or concentration, q. 
For an ideal solution P t = P9q, whereas for non-ideal 
solutions Pi = P^cii, such that a t is an effective con- 
centration equal to the ratio of the partial, or vapour, 
pressure of the ith component above the solution to its 
pressure in the standard state. The chemical potential 
may then be rewritten 

/q- = /I 0 . + RTlnq (12.4) 

where for an ideal gas mixture a t = P t /P ( - and by 
definition P9 = 1. For the copper oxide reaction, the 
law of mass action becomes 

K = 4^ = 2 - = exp[— AGYR71 (12.5) 

«CU«0 2 Po 2 

where a " is replaced by unity for any component 
present in equilibrium as a pure solid or liquid. Some 
solutions do behave ideally (e.g. Mn in Fe) obeying 
Raoult’s law with a t = q. Others tend to in dilute 
solution (e.g. Fe in Cu) but others deviate widely with 
a t approximately proportional to q (Henry’s law). 

12.2.1.2 Kinetics of oxidation 
Free energy changes indicate the probable stable reac- 
tion product but make no prediction of the rate at 
which this product is formed. During oxidation the 
first oxygen molecules to be absorbed on the metal 
surface dissociate into their component atoms before 
bonding chemically to the surface atoms of the metal. 
This process, involving dissociation and ionization, is 
known as chemisorption. After the build-up of a few 
adsorbed layers the oxide is nucleated epitaxially on 
the grains of the base metal at favourable sites, such as 
dislocations and impurity atoms. Each nucleated region 
grows, impinging on one another until the oxide film 
forms over the whole surface. Oxides are therefore 
usually composed of an aggregate of individual grains 
or crystals, and exhibit phenomena such as recrystal- 
lization, grain growth, creep involving lattice defects, 
just as in a metal. 

If the oxide film initially produced is porous the 
oxygen is able to pass through and continue to react 
at the oxide -metal interface. Usually, however, the 
film is not porous and continued oxidation involves 
diffusion through the oxide layer. If oxidation takes 
place at the oxygen -oxide surface, then metal ions 
and electrons have to diffuse through from the under- 
lying metal. When the oxidation reaction occurs at 



Corrosion and surface engineering 379 


the metal -oxide interface, oxygen ions have to dif- 
fuse through the oxide and electrons migrate in the 
opposite direction to complete the reaction. 

The growth of the oxide film may be followed by 
means of a thermobalance in conjunction with met- 
allographic techniques. With the thermobalance it is 
possible to measure to a sensitivity of 10 -7 g in an 
accurately controlled atmosphere and temperature. The 
most common metallographic technique is ellipsom- 
etry, which depends on the change in the plane of 
polarization of a beam of polarized light on reflection 
from an oxide surface; the angle of rotation depends 
on the thickness of the oxide. Interferometry is also 
used, but more use is being made of replicas and thin 
films in the transmission electron microscope and the 
scanning electron microscope. 

The rate at which the oxide film thickens depends 
on the temperature and the material, as shown in 
Figure 12.2. During the initial stages of growth at low 
temperatures, because the oxygen atoms acquire elec- 
trons from the surface metal atoms, a strong electric 
field is set up across the thin oxide film pulling the 
metal atoms through the oxide. In this low- temperature 
range (e.g. Fe below 200°C) the thickness increases 
logarithmically with time (. x oc In t), the rate of oxida- 
tion falling off as the field strength diminishes. 

At intermediate temperatures (e.g. 250-1000°C in 
Fe) the oxidation develops with time according to a 
parabolic law (. x 2 oc t ) in nearly all metals. In this 
region the growth is a thermally-activated process and 
ions pass through the oxide film by thermal move- 
ment, their speed of migration depending on the nature 
of the defect structure of the oxide lattice. Large 
stresses, either compressive or tensile, may often build 
up in oxide films and lead to breakaway effects when 
the protective oxide film cracks and spalls. Repeated 
breakaway on a fine scale can prevent the develop- 
ment of extensive parabolic growth and the oxida- 
tion assumes an approximately linear rate or even 
faster. The stresses in oxide film are related to the 
Pilling -Bed worth (P-B) ratio, defined as the ratio 
of the molecular volume of the oxide to the atomic 
volume of the metal from which the oxide is formed. 



Figure 12.2 Different forms of oxidation behaviour in 
metals. 


If the ratio is less than unity as for Mg, Na, K, the 
oxide formed may be unable to give adequate pro- 
tection against further oxidation right from the initial 
stages and under these conditions, commonly found 
in alkali metals, linear oxidation (x oc t) is obeyed. 
If, however, the P-B ratio is very much greater than 
unity, as for many of the transition metals, the oxide 
is too bulky and may also tend to spall. 

At high temperatures oxide films thicken according 
to the parabolic rate law, x 2 oc t and the mechanism 
by which thickening proceeds has been explained by 
Wagner. Point defects (see Chapter 4) diffuse through 
the oxide under the influence of a constant concen- 
tration gradient. The defects are annihilated at one of 
the interfaces causing a new lattice site to be formed. 
Specifically, zinc oxide thickens by the diffusion of 
zinc interstitials created at the metal/oxide interface 
through the oxide to the oxide/oxygen interface where 
they are removed by the reaction 

2Zn* ++ + 4e + 0 2 — »> 2ZnO 

The concentration of zinc interstitials at the metal/oxide 
interface is maintained by the reaction 

Zn( meta i) > Zn, T 2e 

with the creation of vacancies in the zinc lattice. The 
migration of charged interstitial defects is accompanied 
by the migration of electrons and for thick oxide films 
it is reasonable to assume that the concentrations of the 
two migrating species are constant at the two surfaces 
of the oxide, i.e. oxide/gas and oxide/metal, governed 
by local thermodynamic equilibria. There is thus a 
constant concentration difference Ac across the oxide 
and the rate of transport through unit area will be 
DAc/x , where D is a diffusion coefficient and x the 
film thickness. The rate of growth is then 

cbc/d t oc D{Ac/x) 

and the film thickens parabolically according to the 
relation 

x 2 = k t (12.6) 

where k is a constant involving several structural 
parameters. Wagner has shown that the oxidation pro- 
cess can be equated to an ionic plus an electronic 
current, and obtained a rate equation for oxidation in 
chemical equivalents cm -2 s -1 involving the transport 
numbers of anions and electrons, respectively, the con- 
ductivity of the oxide, the chemical potentials of the 
diffusing ions at the interfaces and the thickness of 
the oxide film. Many oxides thicken according to a 
parabolic law over some particular temperature range. 
It is a thermally-activated process and the rate constant 
k = ko x exp [-Q/RT] with Q equal to the activation 
energy for the rate-controlling diffusion process. 

At low temperatures and for thin oxide films, a 
logarithmic rate law is observed. To account for this 
the Wagner mechanism was modified by Cabrera 
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and Mott. The Wagner mechanism is only applicable 
when the concentrations of point defects and electrons 
are equal throughout the film; for thin oxide films 
this is not so, a charged layer is established at the 
oxide/oxygen interface. Here the oxygen atoms on the 
outer surface become negative ions by extracting elec- 
trons from the metal underneath the film and so exert 
an electrostatic attraction on the positive ions in the 
metal. When the oxide thickness is =€10 nm this layer 
results in an extremely large electric field being set 
up which pulls the diffusing ions through the film and 
accelerates the oxidation process. As the film thickens 
the field strength decreases as the distance between 
positive and negative ions increases and the oxida- 
tion rate approximates to that predicted by the Wagner 
theory. 

As the scale thickens, according to a parabolic law, 
the resultant stress at the interface increases and even- 
tually the oxide layer can fail either by fracture parallel 
to the interface or by a shear or tensile fracture through 
the layer. In these regions the oxidation rate is then 
increased until the build-up in stress is again relieved 
by local fracture of the oxide scale. Unless the scale 
fracture process occurs at the same time over the whole 
surface of the specimen then the repeated parabolic 
nature of the oxidation rate will be smoothed out and 
an approximately linear law observed. This break- 
away parabolic law is sometimes called paralinear, 
and is common in oxidation of titanium when the 
oxide reaches a critical thickness. In some metals, how- 
ever, such as U, W and Ce, the linear rate process is 
associated with an interface reaction converting a thin 
protective inner oxide layer to a non-protective porous 
oxide. 

12.2.1.3 Parameters affecting oxidation rates 
The Wagner theory of oxidation and its dependence 
on the nature of the defect structure has been suc- 
cessful in explaining the behaviour of oxides under 
various conditions, notably the effects of small alloy- 
ing additions and oxygen pressure variations. The 
observed effects can be explained by reference to 
typical n- and /7-type semiconducting oxides. For 
oxidation of Zn to ZnO the zinc atoms enter the 
oxide interstitially at the oxide/metal interface (Zn -> 
Zn* ++ + 2e), and diffuse to the oxide/oxygen interface. 
The oxide/oxygen interface reaction (2Zn ; - ++ + 4e + 
O 2 -> 2ZnO) is assumed to a rapid (equilibrium) pro- 
cess and consequently, the concentration of defects at 
this interface is very small, and independent of oxygen 
pressure. This is found to be the case experimentally 
for oxide thicknesses in the Wagner region. By con- 
sidering the oxide as a semiconductor with a relatively 
small defect concentration, the law of mass action can 
be applied to the defect species. For the oxide/oxygen 
interface reaction this means that 

[Zn ++ ] 2 [e] 4 = constant 

The effect of small alloying additions can be explained 
(Wagner- Hauffe rule) by considering this equation. 


Suppose an alloying element is added to the metal 
that enters the oxide on the cation lattice. Since there 
are associated with each cation site only two electron 
sites available in the valence band of the oxide, if the 
element is trivalent the excess electrons enter the con- 
duction band, increasing the concentration of electrons. 
For a dilute solution the equilibrium constant remains 
unaffected and hence, from the above equation, the net 
effect of adding the element will be to decrease the 
concentration of zinc interstitials and thus the rate of 
oxidation. Conversely, addition of a monovalent ele- 
ment will increase the oxidation rate. Experimentally 
it is found that A1 decreases and Li increases the oxi- 
dation rate. 

For Cu 2 0, a /7-type semiconductor, the oxide forma- 
tion and cation vacancy (Cu + D) creation take place at 
the oxide/oxygen interface, according to 

0 2 + 4Cu = 2 Cu 2 0 + 4(Cu + D) + 4(eD) 

The defect diffuses across the oxide and is eliminated 
at the oxide-metal interface; the equilibrium concen- 
tration of defects is at the metal -oxide interface and 
the excess at the oxide -oxygen interface. It follows 
therefore that the reaction rate is pressure-dependent. 
Applying the law of mass action to the oxidation reac- 
tion gives 

[Cu + D] 4 [eD] 4 = const P„ 2 

and, since electrical neutrality requires [Cu + D] = 
[eD], then 

[Cu+D] = const P 02 8 (12.7) 

and the reaction rate should be proportional to the 
l/8th power of the oxygen pressure. In practice, it 
varies as Pq 2 , and the discrepancy is thought to be 
due to the defect concentration not being sufficiently 
low to neglect any interaction effects. The addition of 
lower valency cations (e.g. transition metals) would 
contribute fewer electrons and thereby increase the 
concentration of holes, decreasing the vacancy concen- 
tration and hence the rate. Conversely, higher valency 
cations increase the rate of oxidation. 

12.2.1.4 Oxidation resistance 
The addition of alloying elements according to the 
Wagner-Hauffe rule just considered is one way in 
which the oxidation rate may be changed to give 
increased oxidation resistance. The alloying element 
may be added, however, because it is a strong oxide- 
former and forms its own oxide on the metal surface 
in preference to that of the solvent metal. Chromium, 
for example, is an excellent additive, forming a protec- 
tive Cr 2 03 layer on a number of metals (e.g. Fe, Ni) 
but is detrimental to Ti which forms an n-type anion- 
defective oxide. Aluminium additions to copper simi- 
larly improve the oxidation behaviour by preferential 
oxidation to A1 2 0 3 . In some cases, the oxide formed 



Corrosion and surface engineering 381 


is a compound oxide of both the solute and solvent 
metals. The best-known examples are the spinels with 
cubic structure (e.g. Ni0.Cr 2 0 3 and Fe0.Cr 2 0 3 ). It 
is probable that the spinel formation is temperature- 
dependent, with Cr 2 0 3 forming at low temperatures 
and the spinel at higher ones. 

Stainless steels (ferritic, austenitic or martensitic) 
are among the best oxidation-resistant alloys and are 
based on Fe-Cr. When iron is heated above about 
570° C the oxide scale which forms consists of wiistite, 
FeO (a /7-type semiconductor) next to the metal, 
magnetite Fe 3 C >4 (a /7-type semiconductor) next and 
haematite Fe 2 0 3 (an ft -type semiconductor) on the out- 
side. When Cr is added at low concentrations the Cr 
forms a spinel FeO.Cr0 3 with the wiistite and later 
with the other two oxides. However, a minimum Cr 
addition of 12% is required before the inner layer is 
replaced by Cr 2 0 3 below a thin outer layer of Fe 2 0 3 . 
Heat-resistant steels for service at temperatures above 
1000°C usually contain 18% Cr or more, and austenitic 
stainless steels 18% Cr, 8% Ni. The growth of Cr 2 0 3 
on austenitic stainless steels containing up to 20% Cr 
appears to be rate-controlled by chromium diffusion. 
Kinetic factors determine whether Cr 2 0 3 or a duplex 
spinel oxide form, the nucleation of Cr 2 0 3 is favoured 
by higher Cr levels, higher temperatures and by sur- 
face treatments (e.g. deformation), which increase the 
diffusivity. Surface treatments which deplete the sur- 
face of Cr promote the formation of spinel oxide. Once 
Cr 2 0 3 is formed, if this film is removed or disrupted, 

(a) (b) 



/mm? 


(c) 



then spinel oxidation is favoured because of the local 
lowering of Cr. 

When chromium-bearing alloys, such as austenitic 
stainless steels, are exposed to the hot combustion 
products of fossil fuels, the outer layer of 
chromium oxide which forms is often associated 
with an underlying sulphide phase (Figure 12.3a). 
This duplex structure can be explained by using 
phase (stability) diagrams and the concept of 
‘reaction paths’. Previously, in Section 3.2. 8. 5, it 
was indicated that a two-dimensional section could 
be taken through the full three-dimensional diagram 
for a metal -sulphur -oxygen system (Figure 3.23). 
Accordingly, in a similar way, we can extract an 
isothermal section from the full phase diagram for 
the Cr-S-0 system, as shown in Figure 12.3b. The 
chemical activities of sulphur and oxygen in the 
gas phase are functions of their partial pressures 
(concentration). If the partial pressure of sulphur is 
relatively low, the composition of the gas phase will 
lie within the chromium oxide field and the alloy will 
oxidize (Figure 12.3b). Sulphur and oxygen diffuse 
through the growing layer of oxide scale but S 2 diffuses 
faster than 0 2 , accordingly, the composition of the gas 
phase in contact with the alloy follows a ‘reaction 
path’, as depicted by the dashed line. Figure 12.3c 
shows the reaction path for gases with a higher initial 
partial pressure of sulphur. Its slope is such that first 
chromium oxide forms, and then chromium sulphide 
(i.e. Cr + S = CrS). Sometimes the oxide scale may 
crack or form voids. In such cases, the activity of S 2 




Figure 12.3 Reaction paths for oxidation and sulphidation of chromium. 



382 Modern Physical Metallurgy and Materials Engineering 


may rise locally within the scale and far exceed that of 
the main gas phase. Sulphidation of the chromium then 
becomes likely, despite a low concentration of sulphur 
in the main gas stream (Figure 12.3d). 

Relative tendencies of different metallic elements to 
oxidize and/or sulphidize at a given temperature may 
be gauged by superimposing their isothermal ps 2 —Po 2 
diagrams, as in Figure 12.4. For example, with the 
heat-resistant 80Ni-20Cr alloy ( Nichrome ), it can be 
reasoned that (1) Cr 2 0 3 scale and CrS subscale are 
both stable in the presence of nickel, and (2) Cr 2 0 3 
forms in preference to NiO; that is, at much lower 
partial pressures of oxygen. The physical state of a 
condensed phase is extremely important because liq- 
uid phases favour rapid diffusion and thus promote 
corrosive reactions. Although nickel has a higher sul- 
phidation threshold than chromium, the Ni-NiS eutec- 
tic reaction is of particular concern with Ni-containing 
alloys because it takes place at the relatively low tem- 
perature of 645 °C. 

12.2.2 Aqueous corrosion 

12.2.2.1 Electrochemistry of corrosion 
Metals corrode in aqueous environments by an electro- 
chemical mechanism involving the dissolution of the 
metal as ions (e.g. Fe -> Fe 2+ + 2e). The excess elec- 
trons generated in the electrolyte either reduce hydro- 
gen ions (particularly in acid solutions) according to 

2H+ + 2e -> H 2 

so that gas is evolved from the metal, or create 
hydroxyl ions by the reduction of dissolved oxygen 
according to 

0 2 + 4e + 2H 2 0 -> 40H - 

The corrosion rate is therefore associated with the 
flow of electrons or an electrical current. The two 
reactions involving oxidation (in which the metal ion- 
izes) and reduction occur at anodic and cathodic sites, 
respectively, on the metal surface. Generally, the metal 



Figure 12.4 Superimposition of isothermal sections from 
Cr-S-0 and Ni-S-0 systems. 


surface consists of both anodic and cathodic sites, 
depending on segregation, microstructure, stress, etc., 
but if the metal is partially-immersed there is often 
a distinct separation of the anodic and cathodic areas 
with the latter near the waterline where oxygen is read- 
ily dissolved (differential aeration). Figure 12.5 illus- 
trates the formation of such a differential aeration cell; 
Fe 2+ ions pass into solution from the anode and OH - 
ions from the cathode, and where they meet they form 
ferrous hydroxide Fe(OH) 2 . However, depending on 
the aeration, this may oxidize to Fe(OH) 3 , red-rust 
Fe 2 0 3 .H 2 0, or black magnetite Fe 3 0 4 . Such a pro- 
cess is important when water, particularly seawater, 
collects in crevices formed by service, manufacture or 
design. In this form of corrosion the rate-controlling 
process is usually the supply of oxygen to the cathodic 
areas and, if the cathodic area is large, can often lead 
to intense local attack of small anode areas, such as 
pits, scratches, crevices, etc. 

In the absence of differential aeration, the formation 
of anodic and cathodic areas depends on the ability to 
ionize. Some metals ionize easily, others with diffi- 
culty and consequently anodic and cathodic areas may 
be produced, for example, by segregation, or the join- 
ing of dissimilar metals. When any metal is immersed 
in an aqueous solution containing its own ions, positive 
ions go into solution until the resulting electromotive 
force (emf) is sufficient to prevent any further solu- 
tion; this emf is the electrode potential or half-cell 
potential. To measure this emf it is necessary to use 
a second reference electrode in the solution, usually 
a standard hydrogen electrode. With no current flow- 
ing, the applied potential cancels out the extra potential 
developed by the spontaneous ionization at the metal 
electrode over and above that at the standard hydro- 
gen electrode. With different metal electrodes a table 
of potentials (E 0 ) can be produced for the half-cell 
reactions 

M M n+ + ne (12.8) 

where E 0 is positive. The usual convention is to write 
the half-cell reaction in the reverse direction so that 
the sign of E 0 is also reversed, i.e. E 0 is negative; E 0 
is referred to as the standard electrode potential. 

It is common practice to express the tendency of a 
metal to ionize in terms of this voltage, or potential, 
Eq, rather than free energy, where AG = —nFE 0 for 
the half-cell reaction with nF coulomb of electrical 
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Figure 12.5 Corrosion of iron by differential aeration. 
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Table 12.1 Electrochemical Series 


Electrode reaction 

Standard 
electrode 
potential Eq (V) 

Cs = Cs+ + e ~ -3.02 

Li = Li+ + e g -3.02 

* 

II 

* 

+ 

+ 

CD 

0 -2.92 

Na = Na++e -2.71 

Ca = Ca 2+ + 2e § -2.50 

Mg = Mg 2 + + 2e -2.34 

A1 = Al 3+ + 3e 

k -1.07 

Ti = Ti 2+ + 2e 

-1.67 

Zn = Zn 2 + + 2e 

-0.76 

Cr = Cr 3+ + 3e 

-0.50 

Fe = Fe 2+ + 2e 

-0.44 

Cd = Cd 2+ + 2e 

-0.40 

Ni = Ni 2+ + 2e 

-0.25 

Sn = Sn 2+ + 2e 

-0.136 

Pb = Pb 2+ + 2e 

-0.126 

H = 2H+ + 2e 

0.00 


Cu = Cu 2+ + 2e 
Hg = Hg 2 + + 2e | 
Ag = Ag+ + e * 

Pt = Pt 2+ + 2e 3 
Au = Au + + e Z 


+0.34 
+0.80 
+0.80 
+ 1.20 
+ 1.68 


charge transported per mole. The half-cell potentials 
are given in Table 12.1 for various metals, and refer to 
the potential developed in a standard ion concentration 
of one mole of ions per litre (i.e. unit activity), relative 
to a standard hydrogen electrode at 25° C which is 
assigned a zero voltage. The voltage developed in 
any galvanic couple (i.e. two half cells) is given 
by the difference of the electrode potentials. If the 
activity of the solution is increased then the potential 


increases according to the Nernst equation E = E 0 + 
(RT/nF)\na. 

The easily ionizable ‘reactive’ metals have large 
negative potentials and dissolve even in concentrated 
solutions of their own ions, whereas the noble metals 
have positive potentials and are deposited from solu- 
tion. These differences show that the valency electrons 
are strongly bound to the positive core in the noble 
metals because of the short distance of interaction, i.e. 
^atomic — ^ ionic* A metal will therefore displace from 
solution the ions of a metal more noble than itself in 
the Series. When two dissimilar metals are connected 
in neutral solution to form a cell, the more metallic 
metal becomes the anode and the metal with the lower 
tendency to ionize becomes the cathode. The Electro- 
chemical Series indicates which metal will corrode in 
the cell but gives no information on the rate of reac- 
tions. When an anode M corrodes, its ions enter into 
the solution initially low in M + ions, but as current 
flows the concentration of ions increases. This leads 
to a change in electrode potential known as polariza- 
tion, as shown in Figure 12.6a, and corresponds to a 
reduced tendency to ionize. The current density in the 
cell is a maximum when the anode and cathode poten- 
tial curves intersect. Such a condition would exist if 
the two metals were joined together or anode and cath- 
ode regions existed on the same metal, i.e. differential 
aeration. This potential is referred to as the corrosion 
potential and the current, the corrosion current. 

In many reactions, particularly in acid solutions, 
hydrogen gas is given off at the cathode rather than 
the anode metal deposited. In practice, the evolution 
of hydrogen gas at the cathode requires a smaller 
additional overvoltage, the magnitude of which varies 
considerably from one cathode metal to another, and 
is high for Pb, Sn and Zn and low for Ag, Cu, Fe 
and Ni; this overvoltage is clearly of importance in 
electrodeposition of metals. In corrosion, the overvolt- 
age arising from the activation energy opposing the 
electrode reaction decreases the potential of the cell, 




Figure 12.6 Schematic representation of (a) cathode and anode polarization curves and (b) influence of oxygen concentration 
on cathode polarization. 
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i.e. hydrogen atoms effectively shield or polarize the 
cathode. The degree of polarization is a function of 
current density and the potential E to drive the reaction 
decreases because of the increased rate of H 2 evolu- 
tion, as shown in Figure 12.7 for the corrosion of zinc 
and iron in acid solutions. Corrosion can develop up 
to a rate given by the current when the potential differ- 
ence required to drive the reaction is zero; for zinc this 
is z'zn and for iron / Fe . Because of its large overvoltage 
zinc is corroded more slowly than iron, even though 
there is a larger difference between zinc and hydrogen 
than iron and hydrogen in the Electrochemical Series. 
The presence of Pt in the acid solution, because of 
its low overvoltage, increases the corrosion rate as it 
plates out on the cathode metal surface. In neutral or 
alkaline solutions, depolarization is brought about by 
supplying oxygen to the cathode area which reacts with 
the hydrogen ions as shown in Figure 12.6b. In the 
absence of oxygen both anodic and cathodic reactions 
experience polarization and corrosion finally stops; it 
is well-known that iron does not rust in oxygen-free 
water. 

It is apparent that the cell potential depends on the 
electrode material, the ion concentration of the elec- 
trolyte, passivity and polarization effects. Thus it is not 
always possible to predict the precise electrochemical 
behaviour merely from the Electrochemical Series (i.e. 
which metal will be anode or cathode) and the mag- 
nitude of the cell voltage. Therefore it is necessary to 
determine the specific behaviour of different metals in 
solutions of different acidity. The results are displayed 
usually in Pourbaix diagrams as shown in Figure 12.8. 
With stainless steel, for example, the anodic polariza- 
tion curve is not straightforward as discussed previ- 
ously, but takes the form shown in Figure 12.9, where 
the low-current region corresponds to the condition of 
passivity. The corrosion rate depends on the position 
at which the cathode polarization curve for hydrogen 
evolution crosses this anode curve, and can be quite 
high if it crosses outside the passive region. Pourbaix 
diagrams map out the regions of passivity for solu- 
tions of different acidity. Figure 12.8 shows that the 
passive region is restricted to certain conditions of 



Figure 12.7 Corrosion of zinc and iron and the effect of 
polarization. 


pH; for Ti this is quite extensive but Ni is passive 
only in very acid solutions and A1 in neutral solu- 
tions. Interestingly, these diagrams indicate that for Ti 
and Ni in contact with each other in corrosive con- 
ditions then Ni would corrode, and that passivity has 
changed their order in the Electrochemical Series. In 
general, passivity is maintained by conditions of high 
oxygen concentration but is destroyed by the presence 
of certain ions such as chlorides. 

The corrosion behaviour of metals and alloys can 
therefore be predicted with certainty only by obtaining 
experimental data under simulated service conditions. 
For practical purposes, the cell potentials of many 
materials have been obtained in a single environment, 
the most common being sea water. Such data in tabular 
form are called a Galvanic Series, as illustrated in 
Table 12.2. If a pair of metals from this Series were 
connected together in sea water, the metal which is 
higher in the Series would be the anode and corrode, 
and the further they are apart, the greater the corrosion 
tendency. Similar data exist for other environments. 

12.2.2.2 Protection against corrosion 
The principles of corrosion outlined above indicate 
several possible methods of controlling corrosion. 
Since current must pass for corrosion to proceed, any 
factor, such as cathodic polarization which reduces the 
current, will reduce the corrosion rate. Metals having 
a high overvoltage should be utilized where possi- 
ble. In neutral and alkaline solution de-aeration of the 
electrolyte to remove oxygen is beneficial in reducing 
corrosion (e.g. heating the solution or holding under 
a reduced pressure preferably of an inert gas). It is 
sometimes possible to reduce both cathode and anode 
reactions by ‘artificial’ polarization (for example, by 
adding inhibitors which stifle the electrode reaction). 
Calcium bicarbonate, naturally present in hard water, 
deposits calcium carbonate on metal cathodes and sti- 
fles the reaction. Soluble salts of magnesium and zinc 
act similarly by precipitating hydroxide in neutral solu- 
tions. 

Anodic inhibitors for ferrous materials include pota- 
ssium chromate and sodium phosphate, which convert 
the Fe 2+ ions to insoluble precipitates stifling the 
anodic reaction. This form of protection has no effect 
on the cathodic reaction and hence if the inhibitor fails 
to seal off the anode completely, intensive local attack 
occurs, leading to pitting. Moreover, the small current 
density at the cathode leads to a low rate of polarization 
and the attack is maintained. Sodium benzoate is often 
used as an anodic inhibitor in water radiators because 
of its good sealing qualities, with little tendency for 
pitting. 

Some metals are naturally protected by their adher- 
ent oxide films; metal oxides are poor electrical con- 
ductors and so insulate the metal from solution. For 
the reaction to proceed, metal atoms have to diffuse 
through the oxide to the metal -liquid interface and 
electrons back through the high-resistance oxide. The 
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Figure 12.8 Pourbaix diagrams for (a) Ti, (b) Fe, (c) Ni, (d) Al. The clear regions are passive, the heavily -shaded regions 
corroding and the lightly -shaded regions immune. The sloping lines represent the upper and lower boundary conditions in 
service. 



Figure 12.9 Anode polarization curve for stainless steel. 


Table 12.2 Galvanic Series in sea water 


Anodic or most reactive 
Mg and its alloys 
Zn 

Galvanized steel 
Al 

Mild steel 
Cast iron 

Stainless steel (active) 

Pb 

Sn 

Brass 


’> 


Cu 

Ni (active) 

Inconel (active) 

Ag 

Ni (passive) 

Inconel (passive) 

Monel 

Ti 

Stainless steel (passive) 
Cathodic or most noble 


corrosion current is very much reduced by the forma- 
tion of such protective or passive oxide films. Al is 
cathodic to zinc in sea water even though the Electro- 
chemical Series shows it to be more active. Materials 
which are passivated in this way are chromium, stain- 
less steels, Inconel and nickel in oxidizing conditions. 
Reducing environments (e.g. stainless steels in HC1) 
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destroy the passive film and render the materials active 
to corrosion attack. Certain materials may be artifi- 
cially passivated by painting. The main pigments used 
are red lead, zinc oxide and chromate, usually sus- 
pended in linseed oil and thinned with white spirit. 
Slightly soluble chromates in the paint passivate the 
underlying metal when water is present. Red lead 
reacts with the linseed oil to form lead salts of various 
fatty acids which are good anodic inhibitors. 

Sacrificial or cathodic protection is widely used. A 
typical example is galvanized steel sheet when the 
steel is protected by sacrificial corrosion of the zinc 
coating. Any regions of steel exposed by small flaws 
in the coating polarize rapidly since they are cathodic 
and small in area; corrosion products also tend to plug 
the holes in the Zn layer. Cathodic protection is also 
used for ships and steel pipelines buried underground. 
Auxiliary sacrificial anodes are placed at frequent 
intervals in the corrosive medium in contact with the 
ship’s hull or pipe. Protection may also be achieved by 
impressing a d.c. voltage to make it a cathode, with 
the negative terminal of the d.c. source connected to a 
sacrificial anode. 

12.2.2.3 Corrosion failures 

In service, there are many types of corrosive attack 
which lead to rapid failure of components. A familiar 
example is intergranular corrosion and is associated 
with the tendency for grain boundaries to undergo 
localized anodic attack. Some materials are, however, 
particularly sensitive. The common example of this 
sensitization occurs in 18Cr-8Ni stainless steel, which 
is normally protected by a passivating Cr 2 0 3 film after 
heating to 500- 8 00° C and slowly cooling. During 
cooling, chromium near the grain boundaries precip- 
itates as chromium carbide. As a consequence, these 
regions are depleted in Cr to levels below 12% and are 
no longer protected by the passive oxide film. They 
become anodic relative to the interior of the grain and, 
being narrow, are strongly attacked by the corrosion 
current generated by the cathode reactions elsewhere. 
Sensitization may be avoided by rapid cooling, but in 
large structures that is not possible, particularly after 
welding, when the phenomenon (called weld decay) is 
common. The effect is then overcome by stabilizing 
the stainless steel by the addition of a small amount 
(0.5%) of a strong carbide-former such as Nb or Ti 
which associates with the carbon in preference to the 
Cr. Other forms of corrosion failure require the compo- 
nent to be stressed, either directly or by residual stress. 
Common examples include stress-corrosion cracking 
(SCC) and corrosion-fatigue. Hydrogen embrittlement 
is sometimes included in this category but this type of 
failure has somewhat different characteristics and has 
been considered previously. These failures have certain 
features in common. SCC occurs in chemically active 
environments; susceptible alloys develop deep fissures 
along active slip planes, particularly alloys with low 
stacking-fault energy with wide dislocations and pla- 
nar stacking faults, or along grain boundaries. For such 


selective chemical action the free energy of reaction 
can provide almost all the surface energy for fracture, 
which may then spread under extremely low stresses. 

Stress corrosion cracking was first observed in 
a-brass cartridge cases stored in ammoniacal envi- 
ronments. The phenomenon, called season-cracking 
since it occurred more frequently during the mon- 
soon season in the tropics, was prevented by giving 
the cold-worked brass cases a mild annealing treat- 
ment to relieve the residual stresses of cold forming. 
The phenomenon has since extended to many alloys in 
different environments (e.g. Al-Cu, Al-Mg, Ti-Al), 
magnesium alloys, stainless steels in the presence of 
chloride ions, mild steels with hydroxyl ions (caustic 
embrittlement) and copper alloys with ammonia ions. 

Stress corrosion cracking can be either transgran- 
ular or intergranular. There appears to be no unique 
mechanism of transgranular stress corrosion cracking, 
since no single factor is common to all susceptible 
alloys. In general, however, all susceptible alloys are 
unstable in the environment concerned but are largely 
protected by a surface film that is locally destroyed 
in some way. The variations on the basic mechanism 
arise from the different ways in which local activity 
is generated. Breakdown in passivity may occur as a 
result of the emergence of dislocation pile-ups, stack- 
ing faults, micro-cracks, precipitates (such as hydrides 
in Ti alloys) at the surface of the specimen, so that 
highly localized anodic attack then takes place. The 
gradual opening of the resultant crack occurs by plas- 
tic yielding at the tip and as the liquid is sucked in 
also prevents any tendency to polarize. 

Many alloys exhibit coarse slip and have similar dis- 
location substructures (e.g. co-planar arrays of disloca- 
tions or wide planar stacking faults) but are not equally 
susceptible to stress-corrosion. The observation has 
been attributed to the time necessary to repassivate 
an active area. Additions of Cr and Si to susceptible 
austenitic steels, for example, do not significantly alter 
the dislocation distribution but are found to decrease 
the susceptibility to cracking, probably by lowering the 
repassivation time. 

The susceptibility to transgranular stress corrosion 
of austenitic steels, a-brasses, titanium alloys, etc. 
which exhibit co-planar arrays of dislocations and 
stacking faults may be reduced by raising the stacking- 
fault energy by altering the alloy composition. Cross- 
slip is then made easier and deformation gives rise to 
fine slip, so that the narrower, fresh surfaces created 
have a less severe effect. The addition of elements to 
promote passivation or, more importantly, the speed of 
repassivation should also prove beneficial. 

Intergranular cracking appears to be associated with 
a narrow soft zone near the grain boundaries. In a- 
brass this zone may be produced by local dezincifica- 
tion. In high-strength Al-alloys there is no doubt that it 
is associated with the grain boundary precipitate-free 
zones (i.e. PFZs). In such areas the strain-rate may be 
so rapid, because the strain is localized, that repassiva- 
tion cannot occur. Cracking then proceeds even though 
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the slip steps developed are narrow, the crack dis- 
solving anodically as discussed for sensitized stainless 
steel. In practice there are many examples of intergran- 
ular cracking, including cases (1) that depend strongly 
on stress (e.g. Al-alloys), (2) where stress has a com- 
paratively minor role (e.g. steel cracking in nitrate 
solutions) and (3) which occur in the absence of stress 
(e.g. sensitized 18Cr-8Ni steels); the last case is the 
extreme example of failure to repassivate for purely 
electrochemical reasons. In some materials the crack 
propagates, as in ductile failure, by internal necking 
between inclusions which occurs by a combination of 
stress and dissolution processes. The stress sensitivity 
depends on the particle distribution and is quite high 
for fine- scale and low for coarse- scale distributions. 
The change in precipitate distribution in grain bound- 
aries produced, for example, by duplex ageing can thus 
change the stress-dependence of intergranular failure. 

In conditions where the environment plays a role, 
the crack growth rate varies with stress intensity K 
in the manner shown in Figure 12.10. In region I the 
crack velocity shows a marked dependence on stress, 
in region II the velocity is independent of the stress 
intensity and in region III the rate becomes very fast 
as K\c is approached. Ahsc is extensively quoted as 
the threshold stress intensity below which the crack 
growth rate is negligible (e.g. <10 10 m s ! ) but, like 
the endurance limit in fatigue, does not exist for all 
materials. In region I the rate of crack growth is 
controlled by the rate at which the metal dissolves 
and the time for which the metal surface is exposed. 
While anodic dissolution takes place on the exposed 
metal at the crack tip, cathodic reactions occur at the 
oxide film on the crack sides leading to the evolution of 
hydrogen which diffuses to the region of triaxial tensile 
stress and hydrogen-induced cracking. At higher stress 
intensities (region II) the strain-rate is higher, and 
then other processes become rate-controlling, such as 



Figure 12.10 Variation of crack growth rate with stress 
intensity during corrosion. 


diffusion of new reactants into the crack tip region. 
In hydrogen embrittlement this is probably the rate of 
hydrogen diffusion. 

The influence of a corrosive environment, even 
mildly oxidizing, in reducing the fatigue life has been 
briefly mentioned in Chapter 7. The S-N curve shows 
no tendency to level out, but falls to low S- values. The 
damage ratio (i.e. corrosion fatigue strength divided by 
the normal fatigue strength) in salt water environments 
is only about 0.5 for stainless steels and 0.2 for mild 
steel. The formation of intrustions and extrusions gives 
rise to fresh surface steps which form very active 
anodic sites in aqueous environments, analogous to 
the situation at the tip of a stress corrosion crack. 
This form of fatigue is influenced by those factors 
affecting normal fatigue but, in addition, involves 
electro-chemical factors. It is normally reduced by 
plating, cladding and painting but difficulties may arise 
in localizing the attack to a small number of sites, since 
the surface is continually being deformed. Anodic 
inhibitors may also reduce the corrosion fatigue but 
their use is more limited than in the absence of fatigue 
because of the probability of incomplete inhibition 
leading to increased corrosion. 

Fretting corrosion, caused by two surfaces rubbing 
together, is associated with fatigue failure. The oxi- 
dation and corrosion product is continually removed, 
so that the problem must be tackled by improving the 
mechanical linkage of moving parts and by the effec- 
tive use of lubricants. 

With corrosion fatigue, the fracture mechanics 
threshold AAT t h is reduced and the rate of crack 
propagation is usually increased by a factor of two 
or so. Much larger increases in crack growth rate are 
produced, however, in low-frequency cycling when 
stress-corrosion fatigue effects become important. 

12.3 Surface engineering 

12.3.1 The coating and modification of 
surfaces 

The action of the new methods for coating or modi- 
fying material surfaces, such as vapour deposition and 
beam bombardment, can be highly specific and energy- 
efficient. They allow great flexibility in controlling the 
chemical composition and physical structure of sur- 
faces and many materials which resisted conventional 
treatments can now be processed. Grain size and the 
degree of crystalline perfection can be varied over a 
wide range and beneficial changes in properties pro- 
duced. The new techniques often eliminate the need 
for the random diffusion of atoms so that tempera- 
tures can be relatively low and processing times short. 
Scientifically, they are intriguing because their nature 
makes it possible to bypass thermodynamic restrictions 
on alloying and to form unorthodox solid solutions and 
new types of metastable phase. 
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Table 12.3 Methods of coating and modifying surfaces (after R. F. Bunshah, 1984; by permission of Marcel Dekker) 


Atomistic deposition 

Particulate deposition 

Bulk coatings 

Surface modification 

Electrolytic environment 

Thermal spraying 

Wetting processes 

Chemical conversion 

Electroplating 

Plasma- spraying 

Painting 

Electrolytic 

Electroless plating 

Detonation-gun 

Dip coating 

Anodizing (oxide) 

Fused salt electrolysis 

Flame-spraying 

Electrostatic spraying 

Fused salts 

Chemical displacement 

Fusion coatings 

Printing 

Chemical-liquid 

Vacuum environment 

Thick film ink 

Spin coating 

Chemical-vapour 

Vacuum evaporation 

Enamelling 

Cladding 

Thermal 

Ion beam deposition 

Electrophoretic 

Explosive 

Plasma 

Molecular beam epitaxy 

Plasma environment 

Sputter deposition 

Activated reactive evaporation 
Plasma polymerization 

Ion plating 

Chemical vapour environment 
Chemical vapour deposition 
Reduction 

Decomposition 

Plasma enhanced 

Spray pyrolysis 

Impact plating 

Roll bonding 
Overlaying 

Weld coating 

Liquid phase epitaxy 

Leaching 

Mechanical 

Shot-peening 

Thermal 

Surface enrichment 
Diffusion from bulk 
Sputtering 

Ion implantation 

Laser processing 


The number and diversity of methods for coating or 
modifying surfaces makes general classification diffi- 
cult. For instance, the energies required by the various 
processes extend over some five orders of magni- 
tude. Illustrating this point, sputtered atoms have a 
low thermal energy (<1 eV) whereas the energy of 
an ion beam can be >100 keV. A useful introductory 
classification of methods for coating and modifying 
material surfaces appears in Table 12.3, which takes 
some account of the different forms of mass trans- 
fer. The first column refers to coatings formed from 
atoms and ions (e.g. vapour deposition). The second 
column refers to coatings formed from liquid droplets 
or small particles. A third category refers to the direct 
application of coating material in quantity (e.g. paint). 
Finally, there are methods for the near- surface modifi- 
cation of materials by chemical, mechanical and ther- 
mal means and by bombardment (e.g. ion implantation, 
laser processing). 

Some of the methods that utilize deposition from 
a vapour phase or direct bombardment with particles, 
ions or radiation will be outlined: it will be apparent 
that each of the processes discussed has three stages: 

(1) a source provides the coating or modifying specie, 

(2) this specie is transported from source to substrate 
and (3) the specie penetrates and modifies the substrate 
or forms an overlay. Each stage is, to a great extent, 
independent of the other two stages, tending to give 
each process an individual versatility. 

12.3.2 Surface coating by vapour deposition 

12.3.2.1 Chemical vapour deposition 
In the chemical vapour deposition (CVD) process a 
coating of metal, alloy or refractory compound is 
produced by chemical reaction between vapour and a 


carrier gas at or near the heated surface of a substrate 
(Figures 12.11a and 12.11b). CVD is not a ‘line-of- 
sight’ process and can coat complex shapes uniformly, 
having good ‘throwing power’. 1 Typical CVD reac- 
tions for depositing boron nitride and titanium carbide, 
respectively, are: 

500- 1500° C 

BCl3( g ) + NH3( g ) > BN( S ) + 3HCl( g ) 

800-1 000° C 

TiCl4( g ) + CH 4 ( g) > TiC( S ) + 4HCl( g ) 

It will be noted that the substrate temperatures, 
which control the rate of deposition, are relatively high. 
Accordingly, although CVD is suitable for coating 
a refractory compound, like cobalt-bonded tungsten 
carbide, it will soften a hardened and tempered high- 
speed tool steel, making it necessary to repeat the 
high-temperature heat-treatment. In one variant of the 
process (PACVD) deposition is plasma-assisted by a 
plate located above the substrate which is charged with 
a radio-frequency bias voltage. The resulting plasma 
zone influences the structure of the coating. PACVD 
is used to produce ceramic coatings (SiC, Si 3 N 4 ) but 
the substrate temperature of 650°C (minimum) is still 
too high for heat-treated alloy steels. The maximum 
coating thickness produced economically by CVD and 
PACVD is about 100 jam. 

12.3.2.2 Physical vapour deposition 

Although there are numerous versions of the physical 
vapour deposition (PVD) process, their basic design is 

1 The term ‘throwing power’ conventionally refers to the 
ability of an electroplating solution to deposit metal 
uniformly on a cathode of irregular shape. 
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(a) Resistively heated plate 



Figure 12.11 Experimental CVD reactors (from Bunshah, 1984; by permission of Marcel Dekker). 
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Figure 12.12 (a) Evaporation-dependent and (b) sputter- dependent PVD (from Barrell and Rickerby, Aug 1989, pp. 468-73; 
by permission of the Institute of Materials). 


either evaporation- or sputter-dependent. In the former 
case, the source material is heated by high-energy 
beam (electron, ion, laser), resistance, induction, etc. 
in a vacuum chamber (Figure 12.12a). The rate of 
evaporation depends upon the vapour pressure of the 
source and the chamber pressure. Metals vaporize at 
a reasonable rate if their vapour pressure exceeds 
1 N nrr 2 and the chamber pressure is below 10 -3 N 


m 2 . The evaporant atoms travel towards the substrate 
(component), essentially following lines-of-sight. 

When sputtering is used in PVD (Figure 12.12b), 
a cathode source operates under an applied poten- 
tial of up to 5 kV (direct-current or radio-frequency) 
in an atmosphere of inert gas (Ar). The vacuum is 
‘softer’, with a chamber pressure of 1 — 10 — 2 N m~ 2 . As 
positive argon ions bombard the target, momentum is 
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transferred and the ejected target atoms form a coating 
on the substrate. The ‘throwing power’ of sputter- 
dependent PVD is good and coating thicknesses are 
uniform. The process benefits from the fact that the 
sputtering yield (Y) values for metals are fairly simi- 
lar. (Y is the average number of target atoms ejected 
from the surface per incident ion, as determined exper- 
imentally.) In contrast, with an evaporation source, for 
a given temperature, the rates of vaporization can differ 
by several orders of magnitude. 

As in CVD, the temperature of the substrate is of 
special significance. In PVD, this temperature can be 
as low as 200 - 400° C, making it possible to apply the 
method to cutting and metal-forming tools of hard- 
ened steel. A titanium nitride (TiN) coating, <5 pm 
thick, can enhance tool life considerably (e.g. twist 
drills). TiN is extremely hard (2400 HV), has a low 
coefficient of friction and a very smooth surface tex- 
ture. TiN coatings can also be applied to non-ferrous 
alloys and cobalt-bonded tungsten carbide. Experience 
with the design of a TiN-coated steel has demonstrated 
that the coating/substrate system must be considered as 
a working whole. A sound overlay of wear-resistant 
material on a tough material may fail prematurely 
if working stresses cause plastic deformation of the 
supporting substrate. For this reason, and in accor- 
dance with the newly-emerging principles of surface 
engineering, it has been recommended that steel sur- 
faces should be strengthened by nitriding before a TiN 
coating is applied by PVD. 

Two important modifications of the PVD process are 
plasma-assisted physical vapour deposition (PAPVD) 
and magnetron sputtering. In PAPVD, also known as 
‘ion plating’, deposition in a ‘soft’ vacuum is assisted 
by bombardment with ions. This effect is produced 
by applying a negative potential of 2-5 kV to the 
substrate. PAPVD is a hybrid of the evaporation- 


and sputter-dependent forms of PVD. Strong bonding 
of the PAPVD coating to the substrate requires the 
latter to be free from contamination. Accordingly, in 
a critical preliminary stage, the substrate is cleansed 
by bombardment with positive ions. The source is 
then energized and metal vapour is allowed into the 
chamber. 

In the basic magnetron- assisted version of sputter- 
dependent PVD, a magnetic field is used to form 
a dense plasma close to the target. The magnetron, 
an array of permanent magnets or electromagnets, is 
attached to the rear of the target (water-cooled) with its 
north and south poles arranged to produce a magnetic 
field at right angles to the electric field between the tar- 
get and substrate (Figure 12.13a). This magnetic field 
confines electrons close to the target surface, increases 
the rate of ionization and produces a much denser 
plasma. The improved ionization efficiency allows a 
lower chamber pressure to be used; sputtered target 
atoms then become less likely to be scattered by gas 
molecules. The net effect is to improve the rate of 
deposition at the substrate. Normally the region of 
dense plasma only extends up to about 6 cm from the 
target surface. Development of unbalanced magnetron 
systems (Figure 12.13b) has enabled the depth of the 
dense plasma zone to be extended so that the substrate 
itself is subjected to ion bombardment. These energetic 
ions modify the chemical and physical properties of the 
deposit. (In one of the various unbalanced magnetron 
configurations, a ring of strong rare-earth magnetic 
poles surrounds a weak central magnetic pole.) This 
larger plasma zone can accommodate large complex 
workpieces and rapidly forms dense, non-columnar 
coatings of metals or alloys. Target/substrate separa- 
tion distances up to 20 cm have been achieved with 
unbalanced magnetron systems. 



Figure 12.13 Comparison of plasma confinement in conventional and unbalanced magnetrons (PVD) ( from Kelly, Arnell and 
Ahmed, March 1993, pp. 161-5; by permission of the Institute of Materials). 
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Figure 12.14 Coating by detonation- gun ( from Weatherill and Gill, 1988 ; by permission of the Institute of Materials) . 


12.3.3 Surface coating by particle 
bombardment 

Since the first practical realization of gas turbine 
engines in the 1940s, the pace of engineering devel- 
opment has largely been prescribed by the availability 
of suitable high-temperature materials. Components in 
the most critical sections of the engine are exposed to 
hot products of combustion moving at high velocity. In 
addition, there are destructive agents passing through 
the engine, such as sea salt and sand. In this hostile 
environment, it is extremely difficult, if not impossible, 
to develop an alloy that combines the necessary high- 
temperature strength with corrosion resistance. Much 
effort has therefore been devoted to the search for alloy 
systems that will develop a thin self-healing ‘protec- 
tive’ oxide scale. In practice, this outer layer does not 
prevent diffusing atoms from reaching and reacting 
with the alloy substrate and it may also be subject to 
thinning by erosion. The difference in thermal expan- 
sion between the oxide (ceramic) scale and the metallic 
substrate can lead to rupture and spalling of the scale 
if the scale lacks plasticity or is weakly bonded to the 
alloy. Refractory coatings which resist wear and cor- 
rosion provide one possible answer to these problems. 

The two established thermal spray methods 1 of coat- 
ing selected here for brief description are used for 
gas turbine components. In thermal spraying, pow- 
ders are injected into very hot gases and projected at 
very high velocities onto the component (substrate) 
surface. On impact, the particles plastically deform 
and adhere strongly to the substrate and each other. 
The structure, which often has a characteristic lentic- 
ular appearance in cross-section, typically comprises 
refractory constituents, such as carbide, oxide and/or 
aluminide, and a binding alloy phase. Many types of 
thermally sprayed coatings can operate at temperatures 
>1000°C. They range in thickness from microns to 
millimetres, as required. 

In the detonation-gun method (Figure 12.14) a mix- 
ture of metered quantities of oxygen and acetylene 

x The Union Carbide Corporation has been granted patent 
rights for the D-Gun and plasma- spraying methods. 


(C 2 H 2 ) is spark-ignited and detonated. Powder of 
average diameter 45 mm is injected, heated by the 
hot gases and projected from the 1 m long barrel 
of the gun onto the cooled workpiece at a velocity 
of roughly 750 m s _1 . Between detonations, which 
occur four to eight times per second, the barrel is 
purged with nitrogen. Typical applications, and coat- 
ing compositions, for wear-resistant D-Gun coatings 
are bearing sealing surfaces (WC-9Co), compressor 
blades (WC-13Co) and turbine blade shroud interlocks 
(Cr 3 C 2 /80Ni - 20Cr) . 

In the plasma- spray technique, powder is heated 
by an argon-fed d.c. arc (Figure 12.15) and then pro- 
jected on to the workpiece at velocities of 125-600 m 
s" 1 . A shielding envelope of inert gas (Ar) is used 
to prevent oxidation of the depositing material. The 
process is used to apply MCrAlY-type coatings to tur- 
bine components requiring corrosion resistance at high 
temperatures (e.g. blades, vanes), where M signifies 
the high-m.p. metals Fe, Ni and/or Co. These coat- 
ings can accommodate much more of the scale-forming 
elements chromium and aluminium than superalloys 
(e.g. 39Co-32Ni-21Cr-7.5Al-0.5Y). They provide a 
reservoir of oxidizable elements and allow the ‘protec- 
tive’ scale layer to regenerate itself. The small amount 
of yttrium improves scale adhesion. This particular 
composition of coating is used for hot gas path seals 
in locations where a small clearance between the rotat- 
ing blades and the interior walls of the engine gives 
greater fuel efficiency. These coatings will withstand 
occasional rubbing contact. 

12.3.4 Surface modification with high-energy 
beams 

12.3.4.1 Ion implantation 

The chemical composition and physical structure at 
the surface of a material can be changed by bombard- 
ing it, in vacuo, with a high-velocity stream of ions. 
The beam energy is typically about 100 keV; efforts 
are being made to increase the beam current above 
5 mA so that process times can be shortened. Cur- 
rently, implantation requires several hours. The ions 
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Figure 12.15 Coating by plasma-spray torch (from 
Weatherill and Gill, 1988; by permission of the Institute of 
Materials). 


may be derived from any element in the Periodic 
Table: they may be light (most frequently nitrogen) 
or heavy, even radioactive. Ion implantation 1 is a line- 
of-sight process; typically, a bombardment dose for 
each square centimetre of target surface is in the order 
of 10 17 — 10 19 ions. These ions penetrate to a depth 
of 100-200 nm and their concentration profile in a 
plane normal to the surface is Gaussian. Beyond this 
modified region, the properties of the substrate are 
unaffected. 

The beam usually has a sputtering effect which 
ejects atoms from the surface and skews the concen- 
tration profile. This effect is most marked when heavy 
ions or heavy doses are used. It is possible for a steady 
state to be achieved, with the rate of sputter erosion 
equal to the rate of implantation. Thus, depending 
upon the target, the type and energy of ion and the 
substrate material, sputter erosion is capable of limit- 
ing the amount of implantation possible. As a general 
guide, the maximum concentration of implanted ion is 
given, roughly, by the reciprocal of the sputtering yield 
(7). As one would expect, Y increases in value with 
increases in ion energy. However, Y values for pure 
metals are broadly similar, being about 1 or 2 for typi- 
cal argon ion energies and not differing from each other 
by more than an order of magnitude. Thus, because of 
sputter, the maximum concentration of implanted ions 
possible is in the order of 40-50 at.%. In cases where 
it is difficult to attain this concentration, a thin layer of 
the material to be implanted is first deposited and then 
driven into the substrate by bombardment with inert 
gas ions (argon, krypton, xenon). This indirect method 
is called ‘ion beam mixing’. 

During ion bombardment each atom in the near- 
surface region is displaced many times. Various forms 
of structural damage are produced by the cascades of 
collisions (e.g. displacement spikes, vacancy/interstitial 
(Frenkel) pairs, dislocation tangles and loops, etc.). 
Damage cascades are most concentrated when heavy 
ions bombard target atoms of high atomic number (Z). 

1 Pioneered by the UKAEA, Harwell, in the 1960s. 


The injection of atoms and the formation of vacancies 
tend to increase the volume of the target material so 
that the restraint imposed by the substrate produces a 
state of residual compressive stress. Fatigue resistance 
is therefore likely to be enhanced. 

As indicated previously, the ions penetrate to a 
depth of about 300-500 atoms. Penetration is greater 
in crystalline materials than in glasses, particularly 
when the ions ‘channel’ between low-index planes. 
The collision ‘cross-section’ of target atoms for light 
ions is relatively small and ions penetrate deeply. 
Ion implantation can be closely controlled, the main 
process variables being beam energy, ion species, ion 
dose, temperature and substrate material. 

Ion implantation is used in the doping of semicon- 
ductors, as discussed in Chapter 6, and to improve 
engineering properties such as resistance to wear, 
fatigue and corrosion. The process temperature is 
less than 150°C; accordingly, heat-treated alloy steels 
can be implanted without risk of tempering effects. 
Nitrogen-implantation is applied to steel and tungsten 
carbide tools, and, in the plastics industry, has greatly 
improved the wear resistance of feed screws, extrusion 
dies, nozzles, etc. The process has also been used to 
simulate neutron damage effects in low- swelling alloys 
being screened for use in atomic fission and fusion 
reactors. A few hours’ test exposure to an ion beam 
can represent a year in a reactor because the ions have 
a larger ‘cross- section’ of interaction with the atoms in 
the target material than neutrons. However, ions cannot 
simulate neutron behaviour completely; unlike neu- 
trons, ions are electrically charged and travel smaller 
distances (see Chapter 6). 

12.3.4.2 Laser processing 

Like ion implantation, the laser 2 process is under active 
development. A laser beam heats the target material 
locally to a very high temperature; its effects extend 
to a depth of 10-100 pm, which is about a thousand 
times greater than that for an ion beam. Depending on 
its energy, it can heat, melt, vaporize or form a plasma. 
The duration of the energy pulse can be 1 ns or less. 
Subsequent cooling may allow a metallic target zone 
to recrystallize, possibly with a refined substructure, 
or undergo an austenite/martensite transformation (e.g. 
automotive components). There is usually an epitaxial 
relation between the altered near- surface region and 
the substrate. Cooling may even be rapid enough to 
form a glassy structure (laser glazing). Surface alloying 
can be achieved by pre-depositing an alloy on the 
substrate, heating this deposit with a laser beam to 
form a miscible melt and allowing to cool. In this way, 
an integral layer of austenitic corrosion-resistant steel 
can be built on a ferritic steel substrate. In addition to 

2 Light Amplification by Stimulated Emission of Radiation 
(LASER) devices provide photons of electromagnetic 
radiation that are in-phase (coherent) and monochromatic 
(see Chapter 6). 
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its use in alloying and heat-treatment, laser processing 
is used to enhance etching and electroplating, (e.g. 
semiconductors). 

The principal variables in laser processing are the 
energy input and the pulse duration. For established 
techniques like cutting, drilling and welding metals, 
the rate of energy transfer per unit area (‘power den- 
sity’) is in the order of 1 MW cm -2 and pulses 
are of relatively long duration (say, 1 ms). For more 
specialized functions, such as metal hardening by 
shock wave generation, the corresponding values are 
approximately 100 MW cm -2 and 1 ns. Short pulses 
can produce rapid quenching effects and metastable 
phases. 
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Chapter 13 
Biomaterials 


13.1 Introduction 

Biomaterials are materials used in medicine and den- 
tistry that are intended to come in contact with liv- 
ing tissue. The familiar tooth filling is where most 
humans first encounter biomaterials but increasingly 
many people now rely on more critical implants such 
as joint replacements, particularly hips, and cardiovas- 
cular repairs. Undoubtedly, these biomaterial implants 
improve the quality of life for an increasing number 
of people each year, not just for an ageing population 
with greater life expectancy, but for younger people 
with heart problems, injuries or inherited diseases. 

Biomaterials have now been successfully developed 
and used for more than a generation. First-generation 
biomaterials largely depended on being inert, or rel- 
atively inert, with minimal tissue response. For these 
materials a minimal fibrous layer forms between the 
biomaterials and the body when the material is not 
totally accepted by the body. The success of this type 
of implant depends largely on the selection of materi- 
als for their manufacture. Thus the now standard hip 
replacement (almost a million worldwide each year) 
initially used a multi-component assembly made with 
austenitic stainless steel for the stem, PMMA for fix- 
ation and polyethylene for the acetabular cup (see 
Figure 13.1). All the materials proved relatively bio- 
inert and gave an average life time of 10 years or 
more. 

Nowadays, while continuing with improved bio- 
inert materials, development has focused on bioactive 
materials which influence the biological response in 
a positive way, e.g. encourage bonding to surround- 
ing tissue with stimulation of new bone growth. With 
this bio-active approach the interface between the body 
cells and the implant is critical and the materials sci- 
ence of the biomaterials surface extremely important. 

In this chapter, various applications of biomaterials 
will be examined, from dental materials to drug deliv- 
ery systems. All types of materials are used in these 
applications and the criteria governing their selection 



will be considered together with future development 
in the biomaterials field. 

13.2 Requirements for biomaterials 

The requirements for a biomaterial are extremely 
demanding. Replacement or repair of a body feature, 
tissue, organ or function often necessitates the mate- 
rial used to have specialised mechanical, physical and 
chemical properties. However, the very first require- 
ment is biocompatibility with the human body, i.e. the 
ability of the material to perform with an appropriate 
host response. Unfortunately, no material is universally 
biocompatible, since a material may be biocompatible 
in one application but not with another. Biocompati- 
bility is therefore application specific. 

For the successful use of the biomaterial, consider- 
ation has to be given to the appropriate material selec- 
tion, engineering design and manufacturing process. 
While proper design and manufacture is essential, it is 
particularly important to select the correct material to 
provide the appropriate properties as well as being bio- 
compatible, recognizing that the combined influence of 
mechanical and chemical factors can be quite serious, 
e.g. causing fatigue, corrosion fatigue, stress corrosion, 
wear, fracture. It is also important to recognize that the 
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biological environment is not constant and that oxygen 
levels, availability of free radicals and cellular activity 
will vary. Corrosion and degradation can lead to loss 
of integrity of the implant and, of course, release ions 
into the body, often setting up an allergic reaction. 

Biomaterial applications make use of all classes of 
material, metals, ceramics, polymers and composites, 
divided roughly into three usertypes. These are 

(i) inert or relatively inert with minimal host response, 

(ii) bioactive which actually stimulates bonding to 
the surrounding tissue and (iii) biodegradable which 
resorb in the body over a period of time. Metals 
are generally chosen for their inert qualities whereas 
ceramics and polymers may offer bioactivity or 
resorption. 

The most common metallic materials used are 
austenitic stainless steels, cobalt- chromium alloys or 
titanium; typical compositions are shown in Table 13.1. 
Recently, titanium alloys, particularly Ti-6A1-4V, 
have been introduced because of their corrosion resis- 
tance, strength and elastic modulus (see Table 13.2) 
but poor tribology can still be a problem. It is also 
favoured for its superior biocompatibility and, unlike 
Co-Cr or stainless steel, does not cause hypersen- 
sitivity. Of the ceramics, aluminium oxide, calcium 
phosphate, apatite, carbon/graphite and bioglass are in 
use mainly for their inertness, good wear character- 
istics, high compressive strength and in some cases 
bioactivity. Their poor tensile properties and fracture 
toughness are design limitations. Polymers are widely 
used, both alone and in combination with ceram- 
ics or metals. These include; polymethyl methacry- 
late (PMMA) for cement and lenses; polyethylene for 
orthopaedics; polyurethane as blood contact material, 
e.g. vascular tubing, cardiovascular devices, catheters; 
polysiloxanes in plastic surgery, maxillofacial and 


cardiovascular surgery; polyesters and polyamides in 
wound closure management. Composites such as ultra- 
high-molecular-weight polyethylene reinforced with 
either carbon fibres or the ceramic hydroxyapatite are 
increasingly being considered for applications involv- 
ing high contact stress and wear resistance. 

13.3 Dental materials 

13.3.1 Cavity fillers 

Dentistry has always been very dependent on the 
use of biomaterials and particularly receptive to the 
application of new developments in metals, ceramics, 
polymers and composites. 

Dental amalgams have been used in cavity fillings 
for more than 100 years, initially with silver- mercury 
amalgams, later modified by tin additions to control 
the amount of expansion. These amalgams produced 
the weak, corrodible intermetallic y 2 phase, Sn 7 Hg, 
and hence the modem dental amalgam now also con- 
tains copper (>12%) in order to suppress this phase. 
The amalgam is made by mixing silver, tin, cop- 
per alloy powder with mercury and this mixture is 
packed into the cavity where it hardens to produce a 
strong, corrosion-resistant, biocompatible filling. There 
is some evidence that even this filling may be suscep- 
tible to corrosion as a result of the Cu 6 Sn 5 (j] r ) phase 
and the addition of Pd has been advocated. Attempts 
to replace the Hg amalgam by gallium, indium, sil- 
ver, tin, copper pastes have not yet been completely 
successful. 

Alternative resin-based composite filling materials 
have been continuously developed since first intro- 
duced in the 1960s. These composite fillings have a 


Table 13.1 Composition of orthopaedic implant alloys (wt%); from Bonfield, 1997 


Element 


Cobalt-base alloys 

Stainless steel 

Titanium alloys 

ASTM F75 

cast 

ASTM F90 
wrought 

ASTM F563 
isostatically 
pressed 

ASTM FI 38/A 

ASTM F138/9B 

Commercial 

purity 

titanium 

Ti-6Al-4V 

Co 

Balance 

Balance 

Balance 









Cr 

27-30 

19-12 

18-22 

17-20 

17-20 

— 

— 

Fe 

0.75 max 

3.0 max 

4-6 

Balance 

Balance 

0.3-0.5 

0.25 max 

Mo 

5-7 

— 

3-4 

2-4 

2-4 

— 

— 

Ni 

2.5 max 

9-11 

15-25 

10-14 

10-14 

— 

— 

Ti 

— 

— 

0.5-3. 5 

— 

— 

Balance 

Balance 

A1 

— 

— 

— 

— 

— 

— 

5. 5-6. 5 

V 

— 

— 

— 

— 

— 

— 

3. 5-4. 5 

C 

0.35 max 

0.05-0.15 

0.05 max 

0.03 max 

0.08 max 

0.01 max 

0.08 max 

Mn 

1.0 max 

2.0 max 

1.0 max 

2.0 max 

2.0 max 

— 

— 

P 

— 

— 

— 

0.03 max 

0.025 max 

— 

— 

S 

— 

— 

0.01 max 

0.03 max 

0.01 max 

— 

— 

Si 

1.0 max 

1.0 max 

0.5 max 

0.75 max 

0.75 max 

— 

— 

O 

— 

— 

— 

— 

— 

0.18-0.40 

0.13 max 

H 

— 

— 

— 

— 

— 

0.01-0.015 

0.012 max 

N 

— 

— 

— 

— 

— 

0.03-0.05 

0.05 max 
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Table 13.2 The mechanical properties of some natural and biomaterials 


Material 

Elastic 
modulus 
(GN m~ 2 ) 

Tensile 
strength 
(MN m~ 2 ) 

Elongation 

(%) 

Fracture 
toughness 
(MN m ~ 3/ 2 ) 

Fatigue 
strength 
(MN m~ 2 

Austenitic stainless steel 

200 

200-1100 

40 

100 

200-250 

Cobalt- Chromium 

230 

450-1000 

10-30 

100 

600 

Ti-6A1-4V 

105-110 

750-1050 

12 

80 

350-650 

Alumina 

365 

— 

< 1 

— 

400 

Hydroxyapatite 

85 

40-100 

— 

— 


Glass fibre 

70 

2000 

2 

1-4 


PMMA 

2.8 

55 

8 

— 

20-30 

Bone cement 

3-2.3 

1.5 

1-2 

400 


Polyethylene 

1 

20-30 


1-4 

16 

Nylon 66 

4.4 

700 

25 



Silicone rubber 

6 x 10^ 3 

1.4 




Polycarbonate 

2 

60 




Bone (cortical) 

7-25 

50-150 

— 

2-12 


Bone (cancellous) 

0 . 1 - 1.0 

50-150 


2-12 


Tooth enamel 

13 

240 

— 

— 


Tooth dentine 

— 

135 


— 


Collagen, tendon, 

2 

100 

10 

— 



wet 


strength similar to amalgams but poorer wear proper- 
ties. The paste is created by mixing a dimethacrylate 
monomer with resin and adding a filler of micron- sized 
silane-coated ceramic particles. The paste is activated 
by strong light when the resin polymerizes. Bonding 
of the composite resin to the tooth structure employs 
a phosphoric acid etch of the tooth enamel. This pro- 
duces mini-chasms into which the resin material flows 
and locks to form a strong mechanical bond. This tech- 
nique is not successful, however, for bonding to the 
dentine in the tooth cavity (Figure 13.2) and so, in 
the absence of enamel, dentine bonding agents have 
to be used. These are primers containing bifunctional 
compounds with (i) hydrophilic molecules which form 
links with the wet dentine in the tooth cavity and (ii) 
hydrophobic molecules which form links with the resin 
in the composite. 

Cavities in front teeth are usually filled with glass 
cements to match the colour and translucency of the 
enamel. Silicate cements are formed when phosphoric 
acid displaces metal ions from an alumina- silica glass, 
containing metal oxides and fluorides. The cement sets 
when aluminium phosphate is precipitated between 
the glass particles. Developments based on this basic 



chemistry employ polymeric acids with carboxylate 
groups. In this case, the metal ions displaced from the 
glass cross-link with the polymeric acid chain, caus- 
ing the cement to set. In addition, the acids undergo an 
ion-exchange reaction with the calcium phosphate in 
the apatite of the dental material. These glass ionomer 
cements therefore form direct chemical bonds to the 
tooth material. Resin-modified versions are also avail- 
able which have improved durability; these contain 
carboxylate groups to give a good bond to the tooth 
and dimethacrylate, as in the composite resin. 

13.3.2 Bridges, crowns and dentures 

Missing teeth may be replaced by artificial teeth in 
a number of different ways. For a group of missing 
teeth, removal partial dentures (RPDs) may be the 
answer which consist of a cast metal framework of 
Co-Cr or Ni-Cr alloy carrying the artificial teeth 
and having end clasps to retain it to good natural 
teeth nearby. Fixed partial dentures (FPDs or bridges) 
may be used for a few missing teeth. Sometimes the 
supporting teeth are cut down to accommodate a close- 
fitting artificial tooth casting which is cemented into 
place. In other cases, the alloy framework carrying 
the artificial teeth is bonded to acid-etched teeth to 
avoid cutting down good teeth. The teeth are acid 
etched and the metal framework electrolytically etched 
to produce structural grooves and chasms which allow 
strong mechanical bonds to be formed with resin-based 
composite cements. In some situations, etching can be 
avoided when the oxides on the metal framework can 
be treated with bifunctional primers to form chemical 
links to the cement. 

Over the last 20 years or so the quality of bonded 
restorations, i.e. porcelain-covered metal castings, has 
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been refined to combine the impact strength of a metal 
substructure with the appearance of dental porcelain. 
These porcelains have good mechanical properties and 
their thermal expansion characteristics are matched to 
the metal in order to avoid interfacial stresses and 
cracking. 

Dental porcelains are basically vitrified feldspar 
with metallic oxide pigments to simulate natural tooth 
enamel. They are usually supplied to the dental labora- 
tories in the reacted and ground form for final fabrica- 
tion by the technician; this involves mixing the powder 
with distilled water to form a paste which is used to 
make the crown, then drying and firing in order to sin- 
ter and densify the crown material. Generally, firing is 
carried out in stages starting with the innermost struc- 
ture of the crown, followed by the body and finally the 
outer glaze and surface staining. Developments have 
included the strengthening of the inner core material 
with alumina to prevent cracking; also the addition of 
magnesia to form a magnesia- alumina spinel which 
has a low shrinkage on firing. Glass-ceramics have also 
been used either to fabricate the crown by casting fol- 
lowed by heat treatment to produce crystallization, or 
by machining from a pre-fired block of glass-ceramic 
under CAM/CAD conditions. 

For complete replacement dentures, the basic 
material, which has existed for many decades, is 
methyl methacrylate. Substitute materials have been 
limited and most improvements and developments 
have occurred in the processing technology. While 
the mechanical properties of denture base resins 
are not particularly good (modulus of elasticity 3 x 
10 9 N m -2 , tensile strength ~100 kN m -2 , elongation 
~3%) they do have suitable surface and abrasion 
properties and are chemically inert, non-toxic 
and cheap. Improvements have been forthcoming 
in elastomers used for taking impressions; these 
now include vinyl addition silicone and polyether 
elastomers. 

13.3.3 Dental implants 

Dental implants have been far less developed than 
those associated with body implants (see hip joints, 
etc.). Probably the simplest forms are posts, Co-Cr, 
stainless steel, titanium alloy or gold alloys cemented, 
or even screwed, into the tooth canal after the tooth 
has been root treated to remove the nerve. Dental 
porcelain caps may then be cemented onto the root 
post. Ti implants have been screwed into the bone 
beneath extracted teeth. After some time the pas- 
sive surface layer of the titanium implants becomes 
osseo-integrated with the bone and can be used as 
a strong base onto which a titanium mini structure 
can be fitted, complete with tooth assembly. Osseo- 
integration is improved by using a coating on the 
titanium implant such as hydroxyapatite or bioglass 
(Figure 13.3). Ceramic and carbon implants set into 
the bone have been used with sapphire single crystals 
and pyrolitic graphite as favoured materials. 



Figure 13.3 Hydroxyapatite-coated titanium root implant 
courtesy P. Marquis, Dental School, Birmingham. 


13.4 The structure of bone and bone 
fractures 

In repairing bone fractures it is necessary to hold the 
bone together while the natural healing process takes 
place, usually for a few months. Metallic alloys have 
been universally used for plates, pins, screws, etc. They 
are considered to have the required strength and rigid- 
ity together with sufficient corrosion resistance. Gen- 
erally, these fixtures are considered temporary until 
the bone has joined after which they are removed. The 
main reason for this is that the very rigidity with which 
the bone is held to allow healing will eventually lead 
to progressive weakening of the bone structure. 

The biological structure of bone is reproduced in 
Figure 13.4. In material science terms, the apparent 
complexity of bone can be described as a composite 
made up of a matrix of collagen (polymer) reinforced 
with approximately 50% volume fraction of hydroxy- 
apatite (ceramic) nanometre- scale crystals. Most bones 
are made of a porous cellular structure (cancellous 
bone) covered with a denser compact shell. The poros- 
ity and density of the cancellous bone varies with 
location in the body depending on the stress state in 
that region. Low density regions have a relatively open 
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Figure 13.4 The hierarchy of structure in bone. At the molecular level a, polarized triple helix of tropo collagen molecules 
assembles into a microfibril b, with small gaps between the ends of the molecules into which small (5 by 30 nm) crystals of 
hydroxyapatite later form. These microfibrils assemble into larger fibrils c, which then form the layers in the osteon (d — part 
cut away to show the alternating orientation of fibres in the annular layers). Osteons form in association with each other e, 
forming bone f. The cells which are responsible for most of this process, the osteocytes, are shown sitting between the layers of 
the osteon d. (after Vincent, 1990; courtesy Institute of Materials). 


cell structure and high density regions more closed. 
The density determines the strength and stiffness of 
the bone which grows and develops to support the 
stress imposed on it. This may be uniaxial, when 
the cell walls will be oriented and thicker along this 
direction, or more uniform when the cells are roughly 
equiaxed. 

Bone is not a static entity but dynamic in nature, 
continually undergoing remodelling where ‘old’ bone 
is resorbed and replaced by new bone. Various factors 
control the process but extremely important in stim- 
ulating the bone-producing cells (osteoblasts) is the 
application of stress. In bone, the mineral phosphate 
material is slowly dissolved and re sorbed by the body 
and when subjected to normal stress is replenished by 
new bone material synthesized by the osteoblasts. This 
recycling process ensures a healthy, strong bone struc- 
ture with ageing. Bones for which the major stress is 
carried by metal implants do not show the same ten- 
dency for replenishment so that the bone surrounding 
the implant is resorbed without replenishment lead- 
ing to a loosening of the implant. A rigid metal plate 
attached to a bone that has healed will nevertheless 
carry the majority of the load. To avoid this ‘stress 
shielding’ leading to bone weakness it is recommended 
that the surgeon removes the holding plates soon after 
the fracture has healed. 

With the aim of trying to avoid the removal oper- 
ation, alternative approaches are being considered 
for bone plates, screws, etc., including the use of 
biodegradable, or resorbable, materials such as poly- 
mers or composites which could be resorbed into the 


surrounding tissue or dissolve completely over a period 
of time after the fracture has healed. However, the 
combination of strength, ductility, toughness, rigidity 
and corrosion resistance of metals is hard to match 
with the non-metallics. 

The large difference in elastic modulus between 
competing biomaterials and bone (see Table 13.2) is 
evident. Of the various metals, titanium and its alloys 
are clearly the most suitable and are being increasingly 
used. Titanium does, however, have a tribological 
weakness but the application of coatings and surface 
engineering is being increasingly adopted to overcome 
this problem. 

13.5 Replacement joints 

13.5.1 Hip joints 

The basic design of an artificial hip joint includes 
an alloy femoral stem, with a metallic or ceramic 
femoral head moving in an ultra-high-molecular- 
weight polyethylene (UHMWPE) acetabular cup, as 
shown in Figures 13.1 and 13.5. The average life of 
the joint is better than 10 years but implants tend to 
loosen as a result of bone resorption due to modulus 
mismatch. Friction and wear also cause wear debris 
between the cancellous, i.e. more porous, bone and the 
cup and also between the femoral head and the softer 
UHMWPE. Overall the failure rate is now about 1-2% 
per year, as shown in Figure 13.6. 
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Figure 13.5 (a) Conventional hip joint made of stainless 
steel and UHMWPE acetabular cup ( courtesy R. Grimer, 
Royal Orthopaedic Hospital, Birmingham ). (b) Modular 
artifical hip made of titanium stem, alumina head and 
UHMWPE acetabular cup ( courtesy P. Marquis, Dental 
School, Birmingham ). 

Friction and wear problems are being improved 
by using ultra-hard materials as the bearing mate- 
rial. A good resistance to frictional wear together 
with biocompatibility makes alumina well suited for 
the femoral head. Unfortunately, A1 2 0 3 has a low 
impact strength and is liable to failure when subjected 
to stresses introduced by extra activity, e.g. jumping. 
Research has also shown that ion implantation of the 
UHMWPE in a nitrogen atmosphere at 10 5 mbar at 
80 keV to a dose of 1 x 10 17 ions cm -2 reduces the 
wear behaviour to virtually zero. The enchanced sur- 
face properties together with higher hardness and elas- 
tic moduli would very much improve the interaction 
problems. A compromise solution is to have a titanium 
stem with stainless steel head coated with a thin layer 
of ceramic. 

The problem of loosening of the joint due to bone 
resorption is being tackled by the second generation 
of biomaterials and implants which mimic the body’s 
tissues. One such material developed by Professor 
Bonfield and his colleagues at Queen Mary and West- 
field College is a composite of hydroxyapatite (HA) 


in high-density polyethylene. The HA provides the 
strengthening and stiffness reinforcement to the tough 
polyethylene matrix. HAPEX , as it is called, is biocom- 
patible and bioactive, encouraging bone growth onto 
the implant material which, of course, contains HA. 
The biological response has been demonstrated by cell 
culture studies with human oesteoblasts in which the 
cells grew and spread over the composite initiated at 
the HA particles. In vivo testing has shown a stable 
interface between the implant and bone. 

13.5.2 Shoulder joints 

These joints, like hip joints, are increasingly being 
replaced by metal prostheses (Figure 13.7). Initially, 
similar principles were applied to shoulder joints as 
to hip joints, using a stainless steel ball and socket 
with polyethylene cup. However, because it is not a 
load-bearing joint the trend in shoulder replacement is 
simply to resurface the worn part on the humeris side 
with a stainless steel cap rather than replace both sides 
of the joint. 

13.5.3 Knee joints 

Total knee replacements (TKR) are increasingly being 
made to give pain-free improved leg function to many 
thousands of patients suffering from osteoarthritis, or 
rheumatoid arthritis. As with hip replacements, TKR 
involves removing the articulating surfaces of the 
affected knee joint and replacing them with artificial 
components made from biomaterials. Such operations 
have a successful history and a failure rate of less than 
2% per year (Figure 13.6). A typical replacement joint 
consists of a tibial base plate or tray, usually made of 
stainless steel, Co-Cr, or titanium alloy, with a tibial 
insert (UHMWPE) that acts as the bearing surface (see 
Figure 13.8). The femoral component largely takes the 
shape of a natural femoral condyle made from the 
above alloys, and articulates with the bearing surface, 
together with the knee cap or patella. The patella may 
be all polyethylene or metal backed. The components 
are either fixed by cement or uncemented. Cemented 
components use acrylic cement (polymethyl methacry- 
late). Uncemented components rely on bone ingrowth 
into the implant. Titanium used on its own shows evi- 
dence of bone ingrowth, but more recently this has 
been improved by use of hydroxyapatite (HA) coat- 
ing. Problems include wear debris from the PTFE and 
possible fracture of the component. 

One of the continuing challenges is improved design 
against failure, currently about 2%, as a result of over- 
loading, or increased physical activity, and possibly 
bone resorption. 

13.5.4 Finger joints and hand surgery 

Replacement finger and hand joints are far more 
complex than other joints because of the degree of 
flexibility required through large angles while main- 
taining overall stability. Early designs were based on 
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Figure 13.6 The failure rate for a total hip replacement (THR) and total knee replacement (TKR) (courtesy R. Grimer, Royal 
Orthopaedic Hospital, Birmingham). 


the movement of a metal ‘hinge’ but had a problem 
of fatigue and/or corrosion. Nowadays silicone rub- 
ber is more commonly used, showing little tendency 
to fatigue failure while being biocompatible with an 
ability to absorb lipids and fatty acids. 

Polysiloxane is used in hand surgery to form a 
‘tunnel’ to allow transplanted tendons to slide back 
and forward. It is also used to replace carpal bones. 
It is, however, not suitable for longer joints or bones 
unless reinforced. 

13.6 Reconstructive surgery 

13.6.1 Plastic surgery 

The many different biomaterials used in plastic surgery 
include collagen, silicone (polydimethylsiloxane), 
Teflon , polyethylene, Dacron and polyglocolic acid. 
Polysiloxanes are widely used in reconstructive plastic 
surgery because of their lack of any tissue reactivity, 
mechanical properties and structure. Polysiloxane may 
be either heat vulcanized or vulcanized at room 
temperature for more delicate structures and is ideal for 
soft tissue replacement where repeated flexure occurs. 
In applications where strength is also required, steel 
wire reinforcement is necessary. The material is used 
not only for implants but also for explants where it 
is moulded to fit a specific shape, coloured to give a 
skin match and fixed with tissue adhesive. Common 
examples are ear prosthesis or facial reconstruction 
after loss due to injury or cancer surgery. Silicone 
mammary implants are now widely used for cosmetic 



Figure 13.7 Shoulder joint prosthesis ( courtesy P. Marquis, 
Dental School, Birmingham). 
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Figure 13.8 A total knee replacement joint (a) schematic diagram ( after Walker and Sathasiwan, 1999 and (b) photograph of 
a stainless steel prosthesis (courtesy R. Grimer, Royal Orthopaedic Hospital, Birmingham ). 


reasons or after mastectomy for breast cancer. These 
have an outer shell with the appropriate shape and 
resilience covering an inner volume of silicone gel. 
In these reconstructions Dacron may be used as a 
lining for the ear and as a backing in a breast implant 
to provide a better fastening of the implant to the 
surrounding tissue. 

13.6.2 Maxillofacial surgery 

Disease or injury to the oral and facial area may be 
repaired by implants from a range of biomaterials. 


Often the effective prostheses can improve the confi- 
dence and well-being of the patient beyond the imme- 
diate functional repair. The jaw, and jaw bone area, can 
utilize many of the metal implants already discussed. 
Commercially pure titanium in perforated sheet form 
has been used because of its biocompatibility and ease 
of manipulation and fixing. 

Polymers, particularly silicones and polyurethanes, 
may be used to replace flexible tissues of the nose, 
cheek and ear regions of the face. Polysiloxanes have 
been used for onlays in the area of the molar bone in 
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the lateral side of the mandible or over the forehead to 
smooth it. Reinforced with stainless steel wires they 
can also replace the mandible. A porous composite 
of polytetrafluoroethylene (PTFE) strengthened with 
carbon fibres may be used to replace damaged bone 
structures. Both PTFE and carbon are biocompatible 
and fibrous tissue growth into the pores ensures bond- 
ing of the artificial and natural bone. Composites of 
hydroxyapatite in a polyethylene matrix HAPEX have 
been used for patients who had either fractured the 
orbital floor supporting the eye or had lost an eye. A 
great advantage of such material implants is that they 
can be shaped during the operation and inserted on 
the base of the eye socket and bonded firmly to the 
supporting bone. 

13.6.3 Ear implants 

HAPEX has been also used successfully for other clin- 
ical replacements, mainly middle-ear implants. These 
transmit sound from the outer to inner ear where the 
vibrations are translated into electrical signals to be 
processed by the brain. Middle-ear malfunction can 
lead to deafness (conduction deafness) which may 
be cured by implant surgery. Otosclerosis (middle-ear 
deafness) may result from fibrosis of the middle ear 
caused by repeated infections or from an hereditary 
disease. A stapedectomy removes a small amount of 
bone and immobilized tissue and replaces it with bio- 
material. The implant has a hydroxyapatite head on a 
HAPEX shaft which can be trimmed and shaped in 
the operating theatre using an ordinary scalpel to fit 
the individual patient. 

13.7 Biomaterials for heart repair 

13.7.1 Heart valves 

Heart disease is one of the major killers in the devel- 
oped world. Many of the serious conditions arise from 
the strain imposed on the heart by obstructions to the 
flow of blood either in the main passages of the cir- 
culatory system or as a result of valvular disease. The 
heart acts as a blood pump with four valves which open 
in response to a unidirectional flow of blood. Two of 
the valves allow the blood into the heart and the other 
two control the blood leaving the heart. Problems can 
arise with the heart valves if they become structurally 
damaged by disease affecting the opening and closing 
mechanisms. 

The early prostheses developed in the 1960s for 
valve replacement were based on a stainless steel ball- 
in-cage or polysiloxane ball in a Co-Cr alloy cage 
(see Figure 13.9). These restrict the blood flow, even 
in the open position, and were superseded by a tilt- 
ing disc device which opens and closes to the beat 
of the heart to allow the required blood flow. The 
major problem with valves arose from the tendency 
to initiate a blood clot and thus it was found necessary 


to give valve recipients anticoagulant drugs. Nowa- 
days, advanced valves are made with Co-Cr or tita- 
nium bodies with metal or graphite discs, or occluders, 
coated with pyrolytic carbon. These coatings are made 
by heating a hydrocarbon, such as methane, to about 
1500°C depositing the carbon vapour on the graphite 
surface to a thickness of about 1 mm. Small amounts 
of silane mixed with the CH 4 adds Si to the deposit, 
increasing its strength. Pyrolytic carbon is strong and 
wear resistant but, more importantly, resists the forma- 
tion of blood clots on its surface. The discs are attached 
by a coated Ti metal arm to a fabric ring made of poly- 
mer (PTFE) which is sewn to the tissue of the heart 
valve opening. Dacron cloth has also been used and 
encourages tissue growth with better anchorage and 
thromboembolic resistance. 

The prosthesis has moving parts and thus catas- 
trophic failure by fracture is a finite possibility. As an 
alternative, the construction of artificial valves from 
biological tissue has been developed. Collageneous 
tissue from the heart wall of cows and heart valves 
from pigs have been used to make these ‘bioprosthetic’ 
valves. These valves are naturally biocompatible with 



(b) 


Figure 13.9 Photographs of replacement heart valves: 
(a) ball in a cage type (courtesy Institute of Materials, 
London), and (b) pyrolitic carbon disc type (courtesy P. 
Marquis, Dental School, Birmingham ). 
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a reduced risk of thrombosis. Unfortunately two prob- 
lems have hindered this development. The first is that 
the collagen valve material suffers from slow calcifi- 
cation whereby hard deposits form on the surface of 
the valves causing them to stick and tear. The use of 
anticalcification drugs are a possibility but the second 
problem has thrown the whole area of implant surgery 
using animal tissue in doubt. This problem is the emer- 
gence of BSE (mad cow disease) in cattle, which has 
led to restrictions and worries in the use of animal tis- 
sue for reconstructive implant surgery because of the 
fear of transmission of viral illness from animal tissue 
to humans. 

13.7.2 Pacemakers 

Other developments in improving heart performance 
include the use of cardiac pacemakers which produces 
a 5V electrical impulse for 1/500 second at regular 
heartbeat rate. These devices have been available for 
some time but have improved significantly over recent 
years. The basic requirement is to provide electrical 
signals to the heart at the appropriate level to stim- 
ulate the patient’s own electrical activity to produce 
the proper physiological change, normally linking the 
pacemaker into the cardiac system so that it works 
when needed. The biomaterial aspects of the pace- 
maker are, however, also important, not only in over- 
coming the problems introduced by any device/body 
environmental interaction, but also in designing the 
proper electrical supply and insulation. Power sup- 
plies have advanced considerably in the last few years 
and lithium cells are now exclusively used. Titanium 
is again the most common biomaterial to encase the 
device, manufactured and electron beam welded to seal 
it hermetically. Polymers have been used for encapsu- 
lation, e.g. epoxy resin or silicone rubber but these 
materials do not completely prevent moisture from 
entering the pacemaker and shortening the lifetime 
of the device. These have now been superseded by 
titanium alloys because of their better strength and 
environmental properties. These are sutured into the 
aorta with a Dacron sleeve. 

Another problem area is that provided by the elec- 
trodes which have to flex with every heartbeat and 
hence are liable to fatigue failure (see Chapter 7). 
Good design and choice of electrode materials can 
minimize this problem. The electrical supply passes 
through the titanium casing via a ceramic insulator 
and the leads to the heart are insulated with a poly- 
mer (polyetherurethane). Degradation with time is still 
a possibility and has to be considered in an effective 
design. 

13.7.3 Artificial arteries 

In branches of surgery, particularly cardiovascular 
surgery, there is often a need to replace arteries 
blocked by atherosclerosis. Sometimes this can be 
achieved by using tissue grafts from the patients, 
thereby avoiding any immune response. In other cases 


it is necessary to use artificial arteries made from poly- 
mers; such arteries must be tough and flexible enough 
to avoid kinking, with the added requirement of avoid- 
ing the formation of blood clots. In modern surgery, 
the blood clotting tendency can be removed by anti- 
coagulants, such as heparin, but the ultimate goal is to 
provide artificial arteries with natural clot resistance. 
Several different polymers have been used to make 
blood vessels but none is entirely satisfactory. 

The polyester Dacron , can make small tubes but 
these have porous walls which have to be sealed. This 
is achieved by treating with the protein albumin and 
heating it to form a coagulated coating. In the body, the 
albumin degrades and is replaced by the natural protein 
collagen forming a smooth lining (pseudointima). This 
process leads to some initial inflammation which is 
one disadvantage of this biomaterial. Woven Dacron 
is quite rigid and unsuitable for small arteries and is 
difficult to suture; it is mostly used for resected aortic 
aneurysms. Knitted Dacron is easier to suture. It may 
be coated with polyurethane, tetrafluoroethylene, or 
heparin to reduce the thrombogenic tendency. The use 
of poly (hydroxy ethyl methacrylate) coating establishes 
an endothelial-like cell layer in a few weeks. 

Other arterial polymers include PTFE and polyure- 
thanes. PTFE is used as an expanded foam to form 
the porous tubes. These rapidly develop a smooth 
neointima layer and thus acquire blood compatibility. 
Polyurethanes have a natural compatibility with 
blood and are tough and flexible in tube form but 
unfortunately slowly degrade in the body producing 
toxic products. PTFE coatings on polyester and 
polyurethane vessels have also been tried. Silicone- 
lined tubing has been used for extra-corporeal 
circulation during open-heart surgery. 

To produce artificial arteries with built-in clot resis- 
tance, heparin molecules have been attached to their 
surface either directly by chemical bonds or by cross- 
linking to form a polymerized heparin film. To mimic 
total thrombosis resistance, however, requires not only 
anticoagulation but also avoidance of platelet deposi- 
tion normally achieved by the endothelial cells lining 
the blood vessel releasing the protein prostacyclin. 
Ideal artificial arteries should have both of these anti- 
clotting agents attached at their surface. 

13.8 Tissue repair and growth 

Tissues include skin, tendons, ligaments and cartilages. 
Skin has the dual property of keeping the body fluids 
in while allowing the outward movement of moisture 
through a porous membrane, which is important in 
cooling and maintaining the body temperature. Skin 
also protects against infections, such as bacteria but is 
not, of course, particularly strong. It is made up of lay- 
ers including an outer epidermis and an inner dermis, 
a dense network of nerve and blood vessels. It is there- 
fore virtually impossible to make an artificial skin from 
biomaterials to match this complexity. Nevertheless 
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skin replacements have been made from polymers with 
an open structure which provides a basic framework 
onto which real skin is able to grow. Moreover, with a 
biodegradable polymer the framework degrades as the 
new tissue regrows. The porous film can be coated 
with silicone rubber to provide infection protection 
and retain fluids while the skin grows. When sutured 
in place, tissue-forming cells (fibroblasts) migrate into 
the porous polymer framework to generate new skin 
layers. For severe burns, artificial skin can be made 
by growing epidermal skin cells within a biodegrad- 
able collagen mesh in a culture medium. The syn- 
thetic skin can then be grafted onto the patient. Other 
biodegradable products include the copolymers lactic 
acid -glycolic acid and lysine -lactic acid. The adhe- 
sion of the polymer framework can be improved by 
incorporating an adhesive protein fibronectin. 

Other tissues such as ligaments and cartilages are 
largely elastic filaments of fibrous proteins. Synthetic 
substitutes have included Dacron polyesters, PTFE 
fibres and pyrolyzed carbon fibres, with mixed success. 
The fibres may be coated with polylactic acid polymer 
which breaks down in the body to be replaced by col- 
lagen. At this stage such techniques are relatively new 
but it does suggest that in future the growth of cells in 
a culture vessel may possibly supply complex bioma- 
terials for various implanted functions. This approach 
is termed tissue engineering and together with biomet- 
rics, i.e. the mimicking of the working of biological 
systems, offers a way of producing materials which 
totally synergize with the human body. 

13.9 Other surgical applications 

Polymers are used in a wide variety of surgical appli- 
cations. Polyurethane has good tissue and blood con- 
tact properties and is used in both short-term applica- 
tions, e.g. catheters, endotracteal tubes, vascular tub- 
ing, haemodialysis parts, and long-term applications, 
e.g. heart-assist devices. Dacron is used in composite 
form with a poly(2-hyroxyethyl methacrylate) matrix 
for orthopaedic tendon reconstruction. Varying the 
composite mix can alter the properties to match the 
requirements. Reinforced, Dacron fabric is used for 
reconstructing the trachea and, in woven Dacron form, 
for small bowel repair or replacement. It is also used 
in the genitourinary systems, in mesh form in repair- 
ing hernias and abdominal wall defects. Polysiloxanes 
are used in neurological surgery, e.g. in valves to drain 
fluids produced intercranially and also as tubes to drain 
other canals such as the middle ear. Not having a 
porous structure, these tubes resist bacterial contam- 
ination. 

Shape -memory -effect (SME) alloys (see Chapter 9, 
section 9.6.4), particularly Ni-Ti, have been used 
in several biomedical applications because of their 
unique behaviour and for their biocompatibility. In 
orthopaedics, for example, pre-stressed fracture bone 
plates can be made to shrink on heating to provide a 


rigid, compressive load fixing. By contrast, Ni-Ti rods 
can be programmed to provide traction on local heating. 
In other applications Ni-Ti has been used in artificial 
heart muscles, teeth- straightening devices, intrauterine 
contraceptive devices and as a filter in the vena cava 
(inserted cold the filter opens its mesh at the temper- 
ature of the deoxygenated blood flowing back to the 
heart). 

In spinal surgery, commercially pure titanium cables 
and screws have been used for the correction of sco- 
liosis by gradual tightening of the cable to straighten 
the spine. A big advantage of Ti for these devices is 
its resistance to crevice corrosion. 


13.10 Ophthalmics 

The main usage of biomaterials in ophthalmics is for 
hard contact lenses, soft contact lenses, disposable 
contact lenses and artificial interocular lenses. Apart 
from being easier to wear soft contact lenses have 
several advantages over hard lenses. Soft lenses ride 
on a thinner tear film fitting closer to the shape of 
the cornea generally giving better corneal health, less 
irritation from dust under the lens and less likelihood 
of being dislodged. 

For satisfactory lens performance the biomaterial 
must be inert, dimensionally stable to retain its optical 
properties, have a low density (~lg cm -3 ), a refractive 
index close to that of the cornea (1.37) and good oxy- 
gen permeability to maintain a healthy cornea. Hard 
contact lenses first became available using PMMA 
because it was light (density 1.19g cm -3 ), easy to 
shape, has a refractive index of 1.49 and is reasonably 
biocompatible. They are, however, difficult to wear for 
long periods because PMMA is (i) hydrophobic and 
not easily wetted by eye fluids and (ii) has a very low 
oxygen permeability. These have now been superseded 
by soft lenses. 

A wide variety of materials have been used for soft 
lenses including silicones, hydroxyethyl methacrylate 
(HEM A) and copolymers with HEM A as the major 
component and vinyl pyrrolidine to increase the water 
content. Most soft lenses are made from hydrogels; 
poly(2-hydroxyethyl methacrylate) is still the most 
popular, containing a small amount of ethylene glycol 
dimethacrylate to act as a cross-linking agent. These 
hydrogel materials have excellent compatibility and 
other properties but even with good oxygen permeabil- 
ity (100 times better than PMMA) additional oxygen 
transplant via tear exchange is necessary. To increase 
the permeability the water content of the lens is raised 
but too high a water content can result in reduced 
strength and poor handleability. 

After cataract surgery and removal, polysiloxane 
coated with wetting agents such as polyvinyl pyrro- 
lidine (PVP) has been used. For intraocular problems, 
silicone injectable has been used. 
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13.11 Drug delivery systems 

Oral administration of tablets is a familiar method 
for taking medicine. Less familiar is regular injection. 
Both methods have many disadvantages not least of 
which is that the drug is being used as a general body 
medicine when, ideally, it is required only at some 
specific site in the body. A second disadvantage is the 
variation in drug level with time, as it is metabolized 
from a high to an insignificant rate rather than a steady, 
more moderate rate. Nowadays, these disadvantages 
are being addressed by developing controlled drug 
delivery; in some cases to specified tissues or organs. 

One form of controlled drug delivery and targeting 
system uses polymeric biomaterial to contain the drug 
which then escapes by diffusion. The diffusion path- 
way through the polymer is provided by the gaps in the 
chain-like structure which may be varied to control the 
release rate. One such release system uses a copoly- 
mer of lactic and glycolic acids to target peptide-based 
drugs to reduce prostate cancer. Another approach uses 
a biodegradable material chemically bonded to the 
drug. In the body the drug is gradually released as 
the material degrades, giving a continued release over 
a known degradation period at a specific site within 
the body. 

A more ambitious system delivers a drug in response 
to blood chemistry levels, such as the release of insulin 
in response to glucose level and has been used with 
some success. The implanted micro-infusion system 


consists of a titanium reservoir of insulin together with 
a micro-pump which delivers the insulin via a fine 
catheter. In principle, a glucose sensor could provide 
a feedback control to the reservoir to complete the 
creation of an artificial pancreas. 
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Chapter 14 
Materials for sports 


14.1 The revolution in sports 
products 

Within a relatively short period of time, say one gen- 
eration, sport has come into remarkable worldwide 
prominence; in the wake of this phenomenon, inno- 
vative and highly specialized industries have emerged. 
Athletic sports, irrespective of the particular form of 
competition, involve the application and transmission 
of force and the expenditure of energy and, as a natu- 
ral consequence, one of the attendant trends is to take 
advantage of the latest developments in materials sci- 
ence and engineering and to use the latest generation 
of new materials for sporting artefacts such as rackets, 
golf clubs, skis, vaulting-poles, etc. Fortuitously, paral- 
lel long-term activities in laboratories of the aerospace 
industries, among others, have helped to provide and 
sustain a basic armoury of versatile new materials: it is 
now generally appreciated by the viewing public that 
these materials have sport-transforming capabilities. 
(Indeed, their introduction has sometimes led to urgent 
reappraisal of the rule book.) With their aid, a human 
can kick, throw or strike a ball farther, cleave air or 
water at greater speed and, in the event of the occa- 
sional mishap, survive with less risk of personal injury. 

In some sports, traditional materials hold sway and 
there are few material changes. In others, new materi- 
als and designs appear with bewildering rapidity, being 
driven by the competitive spirit of the sports equip- 
ment industry and competitors alike. With this dynamic 
background in mind, we have emphasized principles 
wherever possible. Also, rather than attempting to 
encompass sports such as automobile and yacht racing, 
which involve extremely large financial outlays, we 
have focused upon the more individual sports; that is, 
upon sports which engage both professional and ama- 
teur and which are essentially personal and physical. 
This restriction still leaves an extremely broad arena of 
sporting activity, so we have concentrated on certain 
items of sports equipment which provide an insight 


into the processes of engineering design and manufac- 
ture; topics include a revolutionary polymer-moulding 
technique (tennis racket), bending stiffness and flexure 
(golf club), energy storage and transfer (archery bow), 
joining problems (bicycle frames), duplex steel com- 
posites (fencing foils), durability at low temperatures 
(snowboard bindings) and shock absorption (safety 
helmets). The vital interplay between structure and 
properties will be self-evident. Time and time again, 
ultimate commercial success has depended upon a pre- 
cise identification of crucial material properties during 
the design and development stage. 

As a preliminary, we have taken this opportunity to 
describe the structure and properties of wood. Wood 
provides a benchmark for alternative materials in sport 
and many mimic its cellular structure. In the more gen- 
eral context of engineering materials, wood remains 
extremely important in tonnage and particularly volu- 
metric terms. Wood has always been, and remains, the 
most widely used material for construction; arguably, 
it should appear in any preliminary list of candidate 
materials for any engineering design project. 

14.2 The tradition of using wood 

Wood is a biopolymeric composite. Large long-chain 
molecules of cellulose comprise about 50-60% of the 
structure and form hollow, elongated cells (tracheids). 
The principal orientation of tracheids is parallel to the 
axis of the trunk, giving wood its well-known grain. 
Typically, these fluid-conveying cells are 0.5-5 mm 
long and have an aspect ratio of 100:1. In temperate 
zones of climate, new layers of cells grow beneath the 
bark during spring (large cells, thin walls) and summer 
(small cells, thick walls). Other, more elongated cells, 
known as ray cells, form at right angles to the tracheids 
and carry fluids transversely across the trunk. Growth 
ceases during winter, giving rise to the visible ‘growth 
rings’ in the trunk’s cross-section. Sap diffuses up 
the outer annulus of sapwood while the heartwood 
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provides a supportive core. According to botanical 
classification, broad-leaved trees provide hardwoods 
(ash, oak, elm, maple, balsa, etc.) and coniferous trees 
with needle-shaped leaves provide softwoods (pine, 
spruce, fir, etc.). 

Differences in the wall structure, shape and size 
of the cells are responsible for the many different 
types of wood. Within the strong and elastic wall of 
each tracheid, different types of layers of cellulose 
microfibrils have been identified (Figure 14.1). In the 
primary (P) layer, microfibrils are randomly orientated. 
In the three secondary (S) layers, microfibril orienta- 
tion varies from layer to layer. Thus, in the thick, mid- 
dle S2 layer, the alignment of microfibrils at a slight 
angle to the major axis of the cell is mainly responsible 
for the high tensile strength along the grain. The cells 
are cemented together, between their primary walls, 
by a non-crystalline phase which is rich in lignin. This 
middle lamina (ML) is weaker than the cellulose lay- 
ers. The overall density of wood depends upon cell 
structure, the ratio of dense lignin to cellulose and 
the amount of water absorbed. Moisture modifies the 
layer structure of the cell walls and thereby drastically 
reduces strength and elasticity. 

Because of the fibrous, highly anisotropic structure 
of wood, control of dimensional stability is a potential 
problem. Typically, coefficients of thermal expansion 
for tangential, radial and longitudinal directions in a 
tree trunk are in the order of 70 x 10 -6 , 50 x 10 -6 and 
3 x 10 -6 C -1 , respectively. When felled, timber can 
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Figure 14.1 Schematic representation of cell wall structure 
in wood, showing various layers and their differing 
orientations of cellulose microfibrils. M.L. = middle layer, 
P = primary wall, SI = outer layer of secondary wall, 

S2 = middle layer of secondary wall, S3 = innermost layer 
of secondary wall, HT = helical thickening, W = warty 
layer (from Butterfield and Meylan, 1980: by permission of 
Kluwer Academic Publishers, Norwell, MA, USA). 


contain more than its own weight of water, i.e. > 100% 
moisture. As moisture is lost from within cells and 
from their walls during either air drying or kiln dry- 
ing, shrinkage occurs. Dimensional stability in service 
is achieved by aiming for an equilibrium between the 
moisture content of the wood and atmospheric humid- 
ity. Thus, 10% moisture content (kiln dried) might be 
specified for wood used in a centrally heated build- 
ing whereas 20% moisture (air dried) might be more 
appropriate for outdoor service. 

The mechanical strength of wood increases roughly 
in proportion to density. Longitudinal tensile strength 
with the grain is usually several times greater than ten- 
sile strength across the grain: specific tensile strength 
is generally excellent and comparable to that of mild 
steel. Compressive stress along the grain can cause 
the fibrous structure to buckle: compressive strength is 
therefore not exceptional. In the composite known as 
plywood, thin layers of wood are glued together with 
adjacent grains crossed, giving uniformity of strength 
and minimizing shrinkage. 

Continual stressing can cause structural creep of 
wood; accordingly, archery bows are unstrung when 
not in use. Wood has good fracture toughness, being 
able to withstand the localized damage produced by 
screws and nails. Although wood has often been dis- 
placed from its dominant role, certain sports still take 
advantage of its unique characteristics. The excellent 
damping capacity of certain woods, together with their 
elasticity, has long favoured their use for bats and 
clubs, e.g. golf club heads, hockey sticks, ash hurleys, 
table tennis bats. A classic example was the choice 
of supple blue willow ( Salix alba caerulea), with its 
thick- walled cell structure, for the English cricket bat. 


14.3 Tennis rackets 

14.3.1 Frames for tennis rackets 

Rackets utilizing the latest engineering materials have 
increased the capacity of the professional player to 
make strokes over a remarkable range of pace and 
spin. At an amateur level, their introduction has made 
the sport more accessible by providing a greater tol- 
erance for error and by easing beginners’ problems. 
For many years, hardwoods, such as ash, maple and 
okume, were used in the laminated construction of 
racket frames. The standard area was 70 in 2 . Although 
damping capacity was excellent, their temperature- 
and humidity-dependent behaviour could be awkward: 
wear and warping problems were commonly accepted. 
The introduction of tubular- section aluminium alloy 
and steel frames in the 1960s enabled designers to 
break through the barriers imposed by the use of wood, 
and tennis boomed. In the 1970s, ‘large-head’ rack- 
ets of different shape with a larger area of 105 in 2 
came into vogue. The greater stiffness and strength 
enabled string tensions to be raised and the effective 
playing area, the so-called ‘sweet spot’, was consider- 
ably increased. The ‘sweet spot’ is the central portion 
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of the stringed area where the vibrational nodes are 
minimal, or zero, when the ball is struck. A mishit of 
the ball outside this area generates undesirable vibra- 
tions and, quite apart from spoiling the accuracy and 
power of the subsequent shot, can cause the player to 
develop the painful and debilitating muscular condition 
known as ‘tennis elbow’. Despite their merits, alloy 
frames had insufficient damping capacity and failed to 
displace wooden frames. 

In the next stage of evolution, during the 1970s, 
composite frames were developed which used 
polyester, epoxy or phenolic resin matrices in a 
variety of combinations with continuous fibres of E- 
glass, carbon and aramid ( Kevlar ); although more 
costly than alloy frames, they had better damping 
capacity. ‘Mid-size’ composite rackets with a strung 
area of 80-100 in 2 came into favour. The generally 
accepted manufacturing method was to lay-up the 
continuous fibres in a frame mould, surround them 
with thermosetting resin, compression mould, cure, 
surface finish and drill string holes. 

These ‘conventional’ composite frames were costly 
to produce and used material that was twice as dense 
as a laminated wood frame. In 1980, the world market 
in tennis rackets was 55% wood, 30% composite and 
15% metal. In the same year, injection moulding of 
hollow ‘mid-size’ frames (Figure 14.2) was pioneered 
(Haines et al., 1983). This method directly challenged 
the ‘conventional’ manufacturing route for composites. 
In the original patented process, a composite mix of 
30% v/v short carbon fibres (PAN) and polyamide 
thermoplastic (nylon 66) was injected into a complex 
frame mould carrying an accurately located central 
core. (Rheological studies have shown that, at high 
shear rates, the viscosity of the fibre-laden mixture 


is not greatly different from that of thermoplastic 
alone.) The fusible core was made of Bi-Sn alloy, 
m.p. 138.5°C, some 130°C lower than the melting 
point of the polymeric matrix phase. (Being thermally 
conductive, the core acted as a heatsink and remained 
intact.) The core was slowly melted out in an oil bath 
(150°C) and later reused, leaving a central cavity in the 
frame which could be filled with vibration-damping, 
low-density polyurethane foam. (The melt-out process 
also had the beneficial effect of relieving stresses in 
the frame.) The mould/core design allowed the flowing 
composite to form strong tubular pillars in the frame 
through which the strings could be threaded. The wall 
thickness of the frame was 2.5 mm. The cycle time for 
core preparation, injection moulding and core melt-out 
was only 3 minutes per frame. The balanced handle 
was given a leather grip and filled with medium- 
density polyurethane foam. Injection-moulded frames 
of this type, using a short-fibre composite, were much 
stronger than laminated wood frames and as strong 
as, often stronger than, ‘conventional’ continuous-fibre 
composite frames. 1 

14.3.2 Strings for tennis rackets 

Significant string properties include retention of elas- 
ticity and tension, impact efficiency, directional control 
of the ball, resistance to ultraviolet radiation, gamma 
radiation, abrasion, moisture, creep, chemical agents, 
etc. String tension is a subject of prime importance to 

'For the racket and manufacturing process, Dunlop Sports 
Co. Ltd, UK, won the 1981 Design Council Award. This 
racket type was used by champions Steffi Graf and John 
McEnroe. 
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Figure 14.2 Injection-moulded frame and handle of tennis racket (from Haines et al., 1983: by permission of the Council of 
the Institution of Mechanical Engineers ). 
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tennis players and, because of its intrinsic time depen- 
dence, much study and debate. It is contended that 
some ready- strung rackets, as purchased, are already 
slack. 

Ox-gut, the traditional string material, is very effec- 
tive. However, a greater rebound velocity can be 
imparted to the ball by using synthetic materials of 
higher elastic modulus, such as nylon. Each nylon 
string, typically 1.3 mm diameter, is a braid composed 
of plaited and intertwined bundles of strong fibres. 
After immersion in an elastomeric solution, the string 
is given a polymeric coating to protect it from moisture 
and wear. Rebound velocity is, of course, also directly 
influenced by string tension, which, depending upon 
racket type and personal preference, can range from a 
force of 200 N to more than 300 N. Another promis- 
ing material for strings, initially supplied in the form of 
yam (Zyex, Viet rex), is polyether-etherketone (PEEK). 

14.4 Golf clubs 

14.4.1 Kinetic aspects of a golf stroke 

Consider the violent but eventful history of an effective 
drive with a golf club (Horwood, 1994). Figure 14.3 
shows how the bending moment of the shaft varies 
with time during the 2 second period of the stroke. (A 
bending moment of 1 N m is equivalent to a deflection 
of about 13 mm at the end of the shaft.) In this 
particular test, toward the end of its downward swing 
and prior to impact, the shaft deflected backwards and 
then forwards, the club head reaching a swing speed 
at impact of 42.5 m s -1 (95 miles h -1 ). The period 
of contact between head and ball was approximately 
0.5 ms. After a well-executed stroke, the driven golf 
ball, aided aerodynamic ally by its surface dimples, is 


capable of travelling a distance of some 220-240 m. 
The shaft of a club bends and twists elastically during 
the complete swing and, after impact, vibrations travel 
at the speed of sound along the shaft toward the 
grip. The much-quoted ‘feel’ of a club tends to be a 
highly subjective judgement; for instance, in addition 
to transmitted vibrations, it usually takes into especial 
account the sound heard after the instant of impact. 
A mishit occurs if the striking face of the head is 
imperfectly aligned, horizontally and vertically, with 
the ball; a fair proportion of drives fall into this 
category. The desired ‘sweet spot’ of impact lies at the 
point where a line projected from the centre of gravity 
of the head meets the striking face perpendicularly. 
Off-centre impact (outside the ‘sweet spot’) rotates the 
large head of a ‘wood’ about its centre of gravity, 
causing the ball to either hook sidespin (toed shot) 
or slice sidespin (heeled shot) in accordance with the 
well-known ‘gear effect’. 

The intrinsic difficulties of achieving distance, accu- 
racy and consistency have challenged golf players for 
centuries and continue to do so. One beneficial effect 
of the new materials recently adopted for club heads 
and shafts has been to make this demanding sport 
accessible to a wider range of player ability e.g. senior 
citizens. Despite golf being frequently described as a 
triumph of art over science, concerted efforts are being 
made to explain and rationalize its unique physical 
aspects in engineering and scientific terms (Cochran, 
1994). Such research activities, which are commer- 
cially stimulated, have led to the realization that many 
phenomena attending the impact of club upon ball are 
still imperfectly understood. One consequence is that, 
at a practical level, equipment makers apply a variety 
of testing methods that are often unco-ordinated and 
potentially confusing: there appears to be a need for 
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Figure 14.3 Shaft bending in typical swing of golf club (after Horwood, 1994). 
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standard testing procedures for shafts, club heads, etc. 
to be established internationally. 

14.4.2 Golf club shafts 

The principal design parameters for a shaft are weight, 
bending stiffness, bend point and torsional stiffness. 
For a given head weight, lightening the shaft reputedly 
makes the swing faster and hence gives some increase 
in ball speed and distance. Figure 14.3 has already 
indicated the violent forward and backward bending 
that occurs in the plane of swing during a drive. 
Professional golfers and players with a rapid, compact 
swing favour very stiff shafts. The bend point (also 
called the flex point or kick point) of a shaft is 
generally taken as the region where the minimum 
radius of curvature occurs during bending. A butt- 
flexible shaft has a high bend point (near the grip) and 
suits long hitters. Conversely, a tip-flexible shaft has a 
low bend point (near the head) and suits weaker hitters. 
A low bend point also increases the dynamic loft of 
the struck ball. During a drive shot, the shaft tends 
to twist a few degrees (as well as bend) because the 
centre of gravity of the club head and the long axis of 
the shaft are offset. Torsional stiffness (wrongly called 
‘torque’) helps the striking face to remain ‘square’ 
during impact. Long hitters tend to favour torsionally 
stiff (Tow torque’) shafts. 

A typical driver, total weight 350 g, comprises 
a grip (50 g), steel shaft (100 g) and head (200 g). 
A variety of materials, including alloy steel, alu- 
minium alloy (7075), Ti-6A1-4V alloy and com- 
posites, have been used for the hollow shafts. Car- 
bon, Kevlar , glass, boron and silicon carbide fibres 
have been used as ‘reinforcement’ in shaft compos- 
ites. Both epoxy and alloy matrices have been used 
e.g. 7075 aluminium alloy + 17%v/v short SiC fibres. 
The main advantages of hollow CFRP shafts, loosely 
termed ‘graphite’ shafts, is their relative lightness 
(about 60 g) and high damping capacity. They are 
made by (i) wrapping prepreg sheets of carbon fibre 
around tapered mandrels, (ii) filament winding around 
a mandrel or (iii) resin transfer moulding (RTM) in 
which resin is forced around a carbon fibre sleeve. 
Reproducing exactly the same bending characteristics 
from shaft to shaft is a difficult, labour-intensive task 
and demands considerable care and skill. Quality was 
not always assured with the early CFRP shafts. With 
CFRP, torsional stiffness and the location of the bend 
point can be manipulated by varying the textural form 
and lay-up of the graphite filaments. In this respect, 
steel shafts, which represent the principal competition 
to the more recently introduced CFRP shafts, are more 
restricted. However, they can match CFRP for specific 
stiffness and strength. Paradoxically, irons with ‘heavy 
graphite’ shafts are now available: addition of expen- 
sive boron fibres to the CFRP near the shaft tip allows 
a lighter club head to be used. The net effect is to allow 
the balance point of the club to be shifted 40-50 mm 
higher up the shaft and away from the club head. This 


device seeks to retain the desired damping capacity of 
CFRP while giving an overall balance similar to that 
of a steel shaft. 

The main categories of golf club are (i) wood- 
type clubs (‘woods’), comprising wooden ‘woods’ 
and metal ‘woods’, (ii) iron-type clubs (‘irons’) and 
(iii) putters. Materials used for club heads range 
from hardwoods to alloys (stainless steel, titanium 
alloy, aluminium alloy, copper alloy) and composites 
(CFRP). There has been a tendency for club heads to 
get larger and for shafts to get longer. 

14.4.3 Wood-type club heads 

‘Woods’ have the bulkiest shape and the largest front 
to back dimension. The traditional wooden ‘wood’ 
club heads, which are still highly regarded, are made 
from either persimmon (date-plum tree) or maple (lam- 
inated). These two hardwoods have similar cell struc- 
ture, density and hardness. Their capacity for damping 
vibrations is excellent. Manufacture involves 120-200 
manual operations and is skill demanding. In a prelimi- 
nary curing process, the wood structure is impregnated 
with linseed oil in order to make it waterproof and 
hard. A striking face and a sole plate, both metal- 
lic, are inserted and a central cavity filled with cork. 
The head is protectively coated with polyurethane and 
finally weighs about 200 g. As with all club heads, 
the principal design variables are size, shape, loca- 
tion of the centre of gravity and mass distribution. For 
instance, the cork insert helps to displace mass to the 
outside of the head to give some degree of ‘peripheral 
weighting’, a feature which reduces the twisting action 
of off-centre shots i.e. outside the ‘sweet spot’. 

In the 1970s, metallic ‘woods’ made from alloys 
became available as drivers. Materials ranged from 
stainless steel to the light alloys of aluminium and 
titanium. Nowadays 17Cr-4Ni stainless steel is a pop- 
ular choice. Compared to traditional wooden ‘woods’, 
alloy heads are easier to make and repeatable quality 
is more easily achieved, largely because of the intro- 
duction of investment casting. In this modern version 
of the ancient Tost wax’ process, which is eminently 
suitable for high m.p. alloys that cannot be diecast, a 
thin refractory shell is formed around a wax pattern. 
The wax is melted to leave a cavity which serves as 
a detailed mould for molten steel. The precision cast- 
ings are welded to form a thin-walled, hollow shell. 
The central cavity is usually filled with low-density 
foam. Compared with wooden ‘woods’, the moment 
of inertia is greater and the centre of gravity is located 
closer to the striking face. The shell construction gives 
peripheral weight distribution away from the centre 
of gravity of the head. In a recent design the cen- 
tre of gravity has been lowered by locating dense 
Cu/W inserts in the sole, e.g. Trimetal clubs. Over- 
all, such innovative features confer a greater tolerance 
for mishits. For instance, a greater moment of inertia 
reduces twisting of the head when impact is off-centre 
and also reduces spin. 
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Composite ‘woods’ have been made from CFRP. 
These heads, which emphasize lightness and strength, 
are similar in shape and style to classic wooden heads 
and have a similar placing of the centre of gravity. Typ- 
ically, they have wear-resistant alloy sole plates and a 
foam-filled core. Compression moulding or injection 
moulding are used in their manufacture. 


14.4.4 Iron-type club heads 

The relatively narrow heads for ‘irons’ are usually 
made from steels and copper alloys which are shaped 
by either hot forging or investment casting. Stainless 
materials include 17Cr-4Ni and AISI Types 431 and 
304. As an alternative to the traditional blade-type 
head, ‘cavity-back irons’ provide peripheral weighting. 
In a recent innovation, a non-crystalline zirconium- 
based alloy 2 containing Cu, Ti, Ni and Be has been 
used for the heads of irons (and putters). This alloy 
has high specific strength and good damping capacity 
and can be successfully vacuum cast in a glassy state 
without the need for ultra-fast freezing rates. 


14.4.5 Putting heads 

Although a set of golf clubs may only contain one 
putter, it is typically used for 40-50% of the strokes 
in a game. The dynamic demands are less than those 
for ‘woods’ and ‘irons’, consequently putter designs 
have tended to be less innovative. Design parameters 
include ‘sweet spot’, weight distribution, bending stiff- 
ness, etc., as previously. Alloys used include stainless 
steel (17Cr-4Ni), manganese ‘bronze’ (Cu-Zn-Mn) 
and beryllium- copper (Cu-2Be). 

14.5 Archery bows and arrows 

14.5.1 The longbow 

For centuries, archery bows have combined design 
skill with knowledge of material properties. From 
the evidence of many well-preserved yew longbows 
retrieved in 1982 from the wreck of the Tudor war- 
ship ‘ Mary Rose’, which sank in Portsmouth harbour 
(1545), we know that the original seasoned stave was 
shaped in such a way as to locate sapwood on the outer 
convex ‘back’ of the bow and darker heartwood at the 
concave ‘belly’ surface. When the bow was braced 
and drawn, this natural composite arrangement gave 
the greatest resistance to the corresponding tensile and 
compressive stresses. Considerable force, estimated to 
be in the order of 36-72 kgf (80-160 lbf), was needed 

2 Vitreloy or Liquidmetal, developed at Caltech, now 

produced by Howmet Corp. Greenwich, CT-06830, USA. 


to draw a heavy longbow. 3 The mystique of the long- 
bow and its near-optimum design have intrigued engi- 
neers and scientists; their studies have greatly helped 
in providing a theoretical basis for modern designs of 
bows and arrows (Blyth & Pratt, 1976). 

14.5.2 Bow design 

A bow and its arrows should be matched to the strength 
and length of the archer’s arm. A well-designed bow 
acts as a powerful spring and transfers stored strain 
energy smoothly and efficiently to the arrow. As the 
archer applies force and draws the bowstring from 
the braced condition (which already stores energy) 
through a draw distance of about 35 cm, additional 
energy is stored in the two limbs (arms) of the bow. In 
general, increasing the length of the bow reduces stress 
and increases the potential for energy storage. Upon 
release of the bowstring, stored energy accelerates the 
arrow as well as the string and the two limbs of the 
bow. The efficiency (p) of the bow at the moment 
of loose may be taken simply as the kinetic energy 
of the arrow divided by work expended in drawing 
the bow. Alternatively, allowance can be made for the 
energy-absorbing movement of the two limbs, as in 
the Klopsteg formula: 

Yj = m/k + m (14.1) 

where m is the weight of the arrow and k is the ‘virtual 
weight’ of the particular bow. The constant term k 
treats energy losses in the bow as an extra burden on 
the driven arrow, travelling with the same velocity. 
For a given bow and draw force, p increases with 
arrow mass. Thus, for a certain yew bow ( k = 23.5 g), 
increasing the arrow weight from 23.5 g to 70.5 g 
increased the efficiency of the bow from 50% to 75%. 

Bow materials are often compared in terms of spe- 
cific modulus of rupture and specific modulus of elas- 
ticity. Thus, for wooden bows, timbers which combine 
a high MoR/p with a comparatively low E/p are gen- 
erally preferred as they provide lightness, the necessary 
resistance to bending stresses and a capacity to store 
energy. The fine-grained hardwoods ash and wych elm 
meet these criteria. Although nominally a softwood, 
yew was favoured for longbows, its very fine grain 
giving remarkable bending strength. Strain energy per 
unit volume can be derived from the stress v. strain 
diagram and expressed as 0.5 s 2 E. It follows that max- 
imizing strain s (below the elastic limit) is an effective 
way of maximizing stored energy. 

There are two main types of modern bow, the stan- 
dard recurve (Olympic) bow and the more complex 
compound bow. In contrast to the D- section of the 

3 Skeletal remains of an archer, taken from the same 
shipwreck, indicate that a lifetime of drawing the longbow 
produced permanent physical deformation. This powerful 
weapon was developed in conflicts in the Welsh Marches; 
its ability to penetrate plate armour had both military and 
social significance. 
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Figure 14.4 Modem competition bow, compound-type: laminated upper limb (wood, gfrp, cfrp ) and CNC-machined central 
riser (Al-Mg-Si alloy 6082) (courtesy of Merlin Bows, Loughborough, U.K.). 


Tudor longbow, limb sections of a recurve bow are 
wide, flat and thin, giving resistance to twisting. The 
energy efficiency of a compound bow (Figure 14.4) is 
twice that of the longbow and, as a consequence, can 
propel an arrow much faster, at velocities of 90 m s ' 1 
or more. Pulley cams at the ends of the two limbs sus- 
tain the load at full draw during the sighting period 
of 10 s or more. The most powerful compound bows 
use light alloys for the mid-section (‘riser’ or grip) e.g. 
forged A1 alloy, diecast Mg alloy. Laminated wood is 
used for some bow grips, e.g. maple plus rosewood. 
The two limbs are very often laminated in construction 
and are much less susceptible to temperature change 
and humidity than wood alone. Many different material 
combinations are used for laminae, e.g. CFRP, wood, 
GRP, foam, etc. For instance, in one type of compos- 
ite bow limb, facing and backing strips of GRP are 
joined to each side of a thin core strip of maple with 
epoxy adhesive. Tubular alloy steel limbs have been 
superseded as they were prone to internal aqueous cor- 
rosion: sudden fracture of a drawn bow (or its string) 
can be extremely dangerous. 

14.5.3 Arrow design 

Successful discharge of an arrow from a bow involves 
a careful balancing of three arrow characteristics; 
namely, length, mass and stiffness (‘spine’). Subse- 
quent flight depends on the aerodynamic qualities of 
the design of head, shaft and fletching. The length of 
the arrow is determined by the geometry of the human 
body; typically, lengths range from 71 to 76 cm. The 
mass chosen depends initially upon the type of archery, 
e.g. maximum range, target shooting, etc. The prod- 
uct of efficiency ( 37 ) and stored energy (. E s ) gives the 
kinetic energy of the arrow, hence: 

rjE s = 0.5 mv\ (14.2) 

where m = mass of the arrow and v 0 = its initial 
velocity. Thus velocity increases with bow efficiency 
and decreasing arrow mass. 


Finally, an arrow must possess an optimum, rather 
than maximum, stiffness (‘spine’) which must be 
matched to the bow. Bending stiffness of an arrowshaft 
is measured in a three-point bend test (Figure 7.6). 
Central to the design of an arrow is the phenomenon 
known as the Archer’s Paradox (Figure 14.5). At the 
loose, with the arrow pointing slightly away from the 
target, the arrow is subjected to a sudden compressive 
force along its length which, together with the deflect- 
ing action of the archer’s fingers, generates lateral 
vibrations in the moving arrow. Correct matching of 
the dimensions, stiffness and vibration characteristics 
(frequency, amplitude) of the arrow enables the arrow 
to clear the bow cleanly. In addition to dependence 
on the arrow’s dimensions, the frequency of flexu- 
ral vibration is proportional to the square root of the 
specific stiffness (E/p). Frequencies are in the order 
of 60 Hz. Reasonable agreement has been obtained 
between theory and high-speed cinephotographic stud- 
ies . 4 

Of the 15 kinds of wood used as arrows for long- 
bows in medieval times, ash was generally regarded as 
the best. Nowadays, arrowshafts are tubular and made 
from (i) drawn and anodized aluminium alloy (7075- 
T9, 7178-T9), (ii) similar alloys bonded to a smooth 
outer wrap of unidirectional CFRP and (iii) pultruded 
CFRP. (Early CFRP arrows were unpopular because 
they tended to develop splintering damage.) Most 
arrowshafts are constant in diameter along their length 
but have the disadvantage that their bending moment 
varies, increasing from zero at the ends to a maximum 
at the centre. Tapering (‘barrelling’) the tubular shaft 
from the middle to the ends reduces this undesirable 
flexing characteristic: barrelled arrows are used by top 
professionals. 

Feathers are the traditional fletching material and 
still used but are fragile and suffer from the weather. 
They rotate the arrow and give stability during flight 

4 At the Royal Armaments Research & Development 
Establishment (RARDE), UK. 
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Figure 14.5 The Archer’s Paradox ( after Pratt, 1976). 

but consume kinetic energy. Smooth polymeric vanes 
made from polyethylene terephthalate {Mylar) are 
strong, weather resistant and, because of their lower 
aerodynamic drag, give greater range; the same poly- 
mer is also commonly used in stranded form ( Dacron ) 
for bowstrings. 

14.6 Bicycles for sport 

14.6.1 Frame design 

The modem bicycle is a remarkable device for con- 
verting human energy into propulsion. The familiar 
diamond frame, with its head, top, seat and down 
tubes, evolved in the late nineteenth century. When 
in use, it distorts elastically; this compliance provides 
rider comfort. Compliance absorbs energy and frame 
stiffness is accordingly maximized in racing machines. 
The stress distribution in a working frame is complex, 
being in-plane as well as out-of-plane. Sudden impact 
stresses must be withstood. Poor design, workman- 
ship and/or maintenance can lead to component failure 
which, because of the fluctuating nature of stressing, 
often has fatigue characteristics. 

McMahon & Graham (1992) have provided a 
detailed comparison of typical tube materials for 
frames. Beam theory is used to identify the key 
design parameters. The basic linking formula which 
expresses the stresses and strains at points along a 


beam deflecting under load is: 

M/I = a/y = E/r (14.3) 

where M = bending moment, I = moment of inertia 
of beam section, o — stress at a point, y = distance 
of point from neutral axis of beam, E = modulus of 
elasticity and r = radius of curvature of loaded beam. 
In particular, we are concerned with (i) the maximum 
tensile stress <r max in the convex surface of a tube 
subjected to a nominal bending moment and (ii) the 
corresponding radius r of bending. Obviously, <r max 
should bear a good relation to the yield strength of the 
material and r should be maximized. 

A tubular cross-section offers special advantages. 
Within a bent beam, it locates as much material as 
possible in the highly stressed regions which lie distant 
from the neutral axis: this axis lies in the plane mark- 
ing the transition from tension to compression. Being 
symmetrical in section, a tube can be loaded trans- 
versely in any direction and can withstand torsion. Its 
moment of inertia is n(D 4 — d 4 )/ 64, where D and d 
are the outside and inside diameters of the tube, respec- 
tively. From relation (14.3) it can be seen that, for a 
given bending moment, increasing the moment of iner- 
tia reduces stress and increases the radius of curvature. 
In similar fashion, it can be reasoned that reducing the 
tube wall thickness increases the surface stress (a max ). 
Sometimes it is beneficial to raise the moment of iner- 
tia by changing from a circular cross-section to a more 
expensive elliptical cross-section. Thus, in front wheel 
forks, which are subjected to severe bending stresses, 
an increase in the major diameter of the ellipse reduces 
stress in the crucial plane. 

Table 14.1 compares the bending characteristics of 
tubes made from four typical materials used for cycle 
frames; that is, from plain carbon steel, 0.3C-Cr-Mo 
alloy steel (A1S1 4130), 6061 (T6) aluminium alloy 
and Ti-3A1 -2.5V alloy. Calculated values for bend 
curvature and maximum stress, which are the crite- 
ria of stiffness and permissible loading, are compared. 
The frames of mountain bicycles must sustain sudden 
impact shocks; accordingly, larger-diameter (D) and/or 
thicker-walled tubing is used for certain frame mem- 
bers in order to reduce stress levels. The specific elastic 
moduli (E/p) for the four materials are similar. Alu- 
minium alloy offers weight saving but, because of its 
relatively low E value (70 GN m -2 ), at the expense 
of greater flexure of the frame. Titanium alloy allows 
reductions in tube diameter and wall thickness; its spe- 
cific yield strength (cr y / p) is about two and a half times 
greater than that of Cr-Mo steel. 

Although not included in Table 14.1, cold-drawn 
seamless Mn-Mo tube steels have a special place in 
the history of competitive cycling, e.g. Reynolds 531. 
Their nominal composition is 0.25C-1.4Mn -0.2Mo. 
Introduced in the 1930s, they have been used for the 
frames of many Tour de France winners and are still 
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Table 14.1 Comparison of weight and bending characteristics of four metallic frame materials: r and cr max calculated 
for tubes subjected to a bending moment of 100 N m (from McMahon & Graham, 1992) 


Material 

D 

d 

Moment 

Mass per 

Radius of 

Maximum 

Gmax/ Gy 


(mm) 

(mm) 

of inertia 1 

unit length 

curvature r 

stress cr max 





(cm 4 ) 

(g mT 1 ) 

(mm) 

(MN m~ 2 ) 


Racing cycles 

C steel 

28.70 

26.16 

1.041 

860 

48 

138 

0.58 

Cr-Mo steel 

28.78 

26.93 

0.750 

601 

67 

190 

0.39 

Al alloy 
(6061-T6) 

28.80 

25.91 

1.124 

332 

127 

128 

0.50 

Mountain cycles ( Top tube ) 







Al alloy 

38.10 

35.56 

2.50 

397 

57 

76 

0.30 

(6061-T6) 

Ditto 

34.93 

30.81 

2.87 

575 

49 

61 

0.24 

Ti-3A1-2.5V 

31.75 

29.47 

1.41 

549 

64 

45 

0.18 


widely used for racing cycles. 5 

14.6.2 Joining techniques for metallic frames 

The above guidelines provide a general perspective 
but do not allow for the potentially weakening effect 
of the thermal processes used for joining the ends 
of individual frame tubes. Such joints often coincide 
with the highest bending moments. In mass produc- 
tion, the cold-drawn low-carbon steel tubes of standard 
bicycle frames are joined by brazing. Shaped rein- 
forcing sockets (lugs) of low -carbon steel, together 
with thin inserts of solid brazing alloy, are placed 
around the tube ends, suitably supported, and heated. 
A 60Cu-40Zn alloy such as CZ7A (British Stan- 
dard 1845) freezes over the approximate temperature 
range of 900-870°C as the frame cools and forms a 
strong, sufficiently ductile mixture of a and /3 phases 
(Figure 3.20). Butted tubes are commonly used to 
counteract softening of the steel in the heat-affected 
zones (HAZ); they have a smaller inside diameter 
(< d ) toward the tube ends. For limited production runs 
of specialized racing frames made from butted alloy 
tubes, fillet brazing with an oxy-acetylene torch at a 
lower temperature is more appropriate, using a silver 
brazing alloy selected from the AG series of British 
Standard 1845, such as 50Ag-15Cu-16Zn -19Cd 
(melting range 620-640°C). Cadmium-free alloys are 
advocated if efficient fume-extraction facilities are not 
available because CdO fumes are dangerous to health. 

Tungsten-inert gas (TIG) welding 6 is widely used 
and has tended to replace brazing, e.g. lugless Cr-Mo 
steel frames for mountain bicycles. Unlike oxy acety- 
lene flames, heating is intense and very localized. The 
hardenability of Cr-Mo steels is such that a strong 

5 ‘ 531 ’ tubes were used for the chassis of the jet-powered 
Thrust 2 vehicle in which Richard Noble broke the one-mile 
land speed record (1983), achieving a speed of 1019 km h -1 

6 Patented in the 1930s in the USA., where argon and 

helium were available, this fluxless arc process is widely 
used for stainless steels and alloys of Al, Ti, Mg, Ni and Zr. 


mixture of dispersed alloy carbides, pearlite and possi- 
bly bainite forms in the weld fillet as they air cool from 
temperatures above 850°C. The latest type of low- 
alloy steel for frames, available in either cold-drawn or 
heat-treated condition ( Reynolds 631 and 853), is air 
hardening. Although inherently very hard (400 VPN), 
TIG-welding increases its hardness in the HAZ. It pos- 
sesses better fatigue resistance than other alloy steels 
and its strength/weight ratio makes it competitive with 
Ti-3A1-2.5V alloy and composites. 

Aluminium alloy tubes, which are solution treated 
and artificially aged (T6 condition), present a problem 
because heating during joining overages and softens 
the structure, e.g. 6061, 7005. The high thermal con- 
ductivity of aluminium worsens the problem. Titanium 
alloys, such as the frame alloy Ti-3A1 -2.5V, absorb 
gases and become embrittled when heated in air, e.g. 
oxygen, nitrogen, hydrogen. Again, it is essential to 
prevent this absorption by shrouding the weld pool 
with a flowing atmosphere of inert gas (argon). 

14.6.3 Frame assembly using epoxy adhesives 

These joining problems encouraged a move toward 
the use of epoxy adhesives with sleeved tube joints. 7 
As well as helping to eliminate the HAZ problem, 
adhesives make it possible to construct hybrid frames 
from various combinations of dissimilar materials 
(adherends), including composites. Brake assemblies 
can be glued to CFRP forks. Adhesive bonds also 
damp vibrations, save weight, reduce assembly costs 
and are durable. Extremes of humidity and tempera- 
ture can cause problems and care is essential during 
adhesive selection. Adhesives technology meets the 

7 Adhesive-bonded racing cycles, sponsored by Raleigh 
Cycles of America, were highly successful in the 1984 
Olympic Games. Subsequently, Raleigh made mountain 
cycles from aluminium alloy tubes (6061-T8) bonded with 
Permabond single-part ESP -3 11 epoxy adhesive. 
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stringent demands of modern aircraft manufacturers 8 
and makes a vital contribution throughout the world 
of sport. 

The structural adhesives most widely used in gen- 
eral engineering are the epoxy resins; their ther- 
mosetting character has been described previously 
(Section 2.7.3). Normally they are water resistant. 
They form strong bonds but, being in a glassy state, are 
brittle. Accordingly, thermoplastic and/or elastomeric 
constituents are sometimes included with the ther- 
mosetting component. When using the two-part ver- 
sion of a thermosetting adhesive, it is important to 
control the proportions of basic resinous binder and 
catalytic agent (hardener) exactly, to mix thoroughly 
and to allow adequate time for curing. In single-part 
epoxy adhesives the resin and hardener are pre-mixed: 
rapid curing is initiated by raising the temperature 
above 100°C. Thermoplastic adhesives, used alone, are 
weaker, more heat sensitive and less creep resistant. 
Elastomeric adhesives, based on synthetic rubbers, 
are inherently weak. Meticulous preparation of the 
adherend surfaces is essential for all types of adhesive. 

14.6.4 Composite frames 

Epoxy resins are also used to provide the 
matrix phase in the hollow, composite frames of 
high-performance bicycles. Carbon fibre reinforced 
polymers (Section 11.3.2.1) combine high strength and 
stiffness; their introduction facilitated the construction 
of monocoque (single shell) frames and led to the 
appearance of a remarkable generation of record- 
breaking machines. 9 Typically, they feature a daring 
cantilevered seat, a disc rear wheel and three-spoke 
open front wheels, all of which are made from CFRP. 
An example is depicted in Figure 14.6. 

14.6.5 Bicycle wheels 

The familiar array of wire spokes between axle and rim 
normally uses hard-drawn wire of either plain 0.4% 
carbon steel (AISI 1040) or austenitic 18Cr-8Ni stain- 
less steel (McMahon & Graham, 1992). Each spoke 
is tangential to the axle, thus preventing ‘wind-up’ 
displacement between axle and rim, and is elastically 
pretensioned (e.g. 440 MN m -2 ) so that it is always in 
tension during service. During each wheel revolution, 
the stress on a given spoke is mostly above the preten- 
sion stress, falling once below it. Under these cyclic 
conditions, carbon steel has a greater nominal fatigue 
endurance than the corrosion-resistant 18/8 steel but 

8 Urea-formaldehyde resins ( Beetle cements) revolutionized 
aircraft building in the 1940s when they were used for 
bonding and gap-filling functions with birchwood/balsa 
composites and spruce airframes e.g. De Havilland 
Mosquito, Airspeed Horsa gliders. 

9 The prototype was the Lotus bicycle on which Chris 
Boardman won the 4000 m individual pursuit in the 1992 
Olympic Games at Barcelona. 



Figure 14.6 High-performance Zipp bicycle with 
monocoque frame ( courtesy of Julian Ormandy, School of 
Metallurgy and Materials, University of Birmingham, UK). 

has a smaller resistance to corrosion fatigue. The latter 
property is boosted by plating the carbon steel with a 
sacrificial layer of zinc or cadmium. Both types of steel 
respond well to the strain-hardening action of wire 
drawing through tungsten carbide dies. Wheel rims 
should be strong, stiff, light and corrosion resistant. 
They are often formed by bending strips of extruded, 
precipitation-hardenable aluminium alloy to shape and 
joining e.g. 6061-T6. 

Conventional multi- spoked wheels generate energy - 
absorbing turbulence during rotation. The distinctive 
CFRP front and rear wheels of highly specialized time- 
trial machines, made by such firms as Lotus, Zipp, 
RMIT-AIS and Ultimate Bike, have a much lower 
aerodynamic coefficient of drag. They are the prod- 
ucts of extensive computer-aided design programmes, 
wind tunnel simulations and instrumented performance 
testing. 


14.7 Fencing foils 

A typical steel foil is about 0.9 m long and tapers to 
a rectangular cross-section of 4 mm x 3 mm. This 
design gives a low resistance to buckling under the 
large axial stress produced when an opponent is struck 
directly, an action which can bend the foil forcibly 
into a radius as small as 20 cm. Traditionally, sword- 
makers use medium-carbon alloy steels of the type 
employed in engineering for springs. The extensive 
range of elastic behaviour that is associated with a 
high yield strength is obviously desirable. The foil is 
formed by hot working 10 mm square bar stock and 
then oil quenching and tempering to develop a marten- 
sitic structure with a yield strength in the order of 
1500-1700 MN ncT 2 . On occasion, during a fencing 
bout, the applied stress exceeds the yield strength and 
the foil deforms plastically: provided that the foil is 
defect free, the fencer can restore straightness by care- 
ful reverse bending. In practice, however, used foils 
are not defect free. During bouts, repeated blows from 
the opposing blade produce small nicks in the surface 
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of a foil. In time, it is possible for one of these stress- 
raising notches to reach a critical size and to initiate 
fatigue cracking within the tempered martensite. Final 
failure occurs without warning and the buttoned foil 
instantly becomes a deadly weapon. 

One research response to this problem was to con- 
centrate upon improving fracture toughness and resis- 
tance to fatigue failure, thus eliminating instantaneity 
of failure. 10 In this alternative material, a steel- steel 
composite, lightly tempered fibres of martensite are 
aligned within a continuous matrix phase of tough 
austenite. 10 mm square feedstock of duplex steel for 
the blade-forging machine is produced by diffusion 
annealing packs of nickel-electroplated bars of spring 
steel at a temperature of 1000°C, extruding and hot 
working. While the bars are at elevated temperatures, 
nickel interdiffuses with the underlying steel. Nickel is 
a notable austenite (y)-forming element, as indicated 
previously in Figure 9.2. The optimum volume frac- 
tion of tough austenite is about 5%. This duplex mate- 
rial has the same specific stiffness as the conventional 
steel and has greater fracture toughness. In the event 
of a surface nick initiating a crack in a longitudinal 
filament of brittle martensite, the crack passes rapidly 
across the filament and, upon encountering the tough 
interfilamentary austenite, abruptly changes direction 
and spreads parallel to the foil axis, absorbing energy 
as the austenite deforms plastically and new surfaces 
are formed. In practical terms, if the fencer should fail 
to notice marked changes in the handling character- 
istics of a deteriorating foil, the prolonged nature of 
final fracture is less likely to be dangerous and life 
threatening. Although safer, the duplex foil involves 
increased material-processing costs and has a yield 
strength about 5-10% lower than that of heat-treated 
spring steel. 

Highly alloyed maraging steels (Section 9.2.3) are 
used nowadays for top-level competition fencing. By 
combining solid solution strengthening with fine pre- 
cipitation in low-carbon martensite, they provide the 
desired high yield strength and fracture toughness. 

A typical composition is 0.03C (max)- 18 Ni-9 Co 
-5 Mo -0.7 Ti-0.1 Al. 

14.8 Materials for snow sports 

14.8.1 General requirements 

The previously quoted examples of equipment fre- 
quently share common material requirements and prop- 
erties, such as bending stiffness, yield strength, tough- 
ness, fatigue resistance, density and comfort. However, 
each sport makes its own unique demands on materials. 
In snowboarding and skiing equipment, for instance, 
additional requirements include toughness at sub zero 
temperatures (say down to — 30° C), low frictional drag 

10 Materials research conducted at Imperial College, 

London, on behalf of the fencing sword manufacturers, Paul 
Leon Equipment Co. Ltd, London (Baker, 1989). 


and resistance to prolonged contact with snow and 
moisture. More specifically, in cross-country (Nordic) 
skiing, lightness is very important as it makes striding 
less tiring. From a commercial aspect, it is desirable 
that materials for individual items of equipment should 
be able to display vivid, durable colours and designer 
logos. 

14.8.2 Snowboarding equipment 

The bindings which secure a snowboarder’ s boots to 
the top surface of the board are highly stressed dur- 
ing a downhill run. Good binding design provides a 
sensitive interaction between the board and the snow- 
boarder’s feet, facilitating jumps and turns. Modem 
designs are complex and usually employ a variety of 
polymers. Thus, the recent snowboard design shown in 
Figure 14.7 includes components made from an acetal 
homopolar ( Delrin ), a nylon-based polymer ( Zytel ) 
and a thermoplastic polyester elastomer ( Hytrel ). n 
Highly crystalline Delrin is tough, having a low glass- 
transition temperature (T g ), and strong and fatigue 
resistant. It is also suitably UV resistant and moisture 
resistant. Zytel is tough at low temperatures, can be 
moulded into complex shapes and can be stiffened by 
glass-fibre reinforcement. The third polymer, Hytrel , 
has properties intermediate to those of thermoplas- 
tics and elastomers, combining flexibility, strength and 
fatigue resistance. Both Hytrel and the nylon Zytrel 
can be fibre reinforced. Thus, some snowboard blades 
are made from Zytrel reinforced with fibres of either 
glass or aramid (Kevlar). Colourants mixed with the 
resins give attractive moulded-in colours. 



Figure 14.7 Snowboard binding utilizing: thermoplastic 
elastomer ( Hytrel ) — ankle strap A, spoiler B, ratchet strap 
G, nylon ( Zytrel ) — side frames D and H, base and disc F, 
top frame J; acetal homopolar ( Delrin ) — strap buckles Cl 
and C2 (courtesy of Fritschi Swiss Bindings AG and Du 
Pont UK Ltd). 

11 Delrin, Hytrel and Zytel are registered trademarks of 
DuPont. 
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Wood composites of 
hickory or ash 

Aluminium honeycomb 
or acrylic foam 
also used 

Glass fibre/polyester 
laminates 

Graphite or Kevlar or 
ceramic fibre 
stiffeners 
Ferritic stainless 
steel (or high strength 
aluminium alloy) 

Grooved low friction 
material, e.g. polyurethane (PU) 



ABS, Phenolics 


Figure 14.8 Transverse section showing multi- component structure of a downhill ski ( from Easterling, 1990 by permission of 
the Institute of Materials ) . 


14.8.3 Skiing equipment 

In the older sport of skiing, the principal items of 
equipment are the boots, bindings, skis and poles. 
External Hytrel- Kevlar components have been used 
to enhance the stiffness of ski boots; this feature gives 
the boots a firmer grip on the skier’s ankles and leads 
to better control of the skis. Polymers feature promi- 
nently in many design of ski bindings. For instance, 
in the Fritschi Diamir touring binding, 12 acetal poly- 
mer (Delrin) is used for the locking bar, heel release 
lever, heel block and front swivel plate while glass- 
reinforced nylon ( Zytel ) is used for the front block and 
the two base plates. 

Modern ski designs aim at solving the conflicting 
requirements of (i) longitudinal and torsional stiffness 
that will distribute the skier’s weight correctly and 
(ii) flexibility that will enable the ski to conform to 
irregularities in the snow contour (Easterling, 1993). 
Originally, each ski runner was made from a single 
piece of wood, e.g. hickory. Eaminated wood skis 
appeared in the 1930s. The adoption of polymers for 
ski components in the 1950s, combining lightness 
and resistance to degradation, was followed by the 
introduction of metal frames for downhill skis, e.g. 
alloy steel, aluminium alloy. By the 1960s, GRP and 
CFRP were coming into prominence. The internal 
structure of a ski is determined by the type of skiing 
and, as Figure 14.8 shows, often uses a surprising 
number of different materials. Skis usually have a 
shock-absorbing, cellular core that is natural (ash, 
hickory) and/or synthetic (aramid, aluminium, titanium 

12 Used by Hans Kammerlander in his 1996 ski descent of 
Mount Everest. 


or paper honeycomb). Polyethylene and polyurethane 
have been used for the soles of skis. 

Modem ski poles are tubular and designed to give 
high specific stiffness and good resistance to impacts. 
Nowadays, CFRP- GRP hybrids are favoured. The 
pointed tips are sometimes made of wear-resistant 
carbide. 

14.9 Safety helmets 

14.9.1 Function and form of safety helmets 

Most sports entail an element of personal risk. The 
main function of a safety helmet is to protect the 
human skull and its fragile contents by absorbing as 
much as possible of the kinetic energy that is vio- 
lently transferred during a collision. The three principal 
damaging consequences of sudden impact are fracture 
of the skull, linear acceleration of the brain relative 
to the skull, and rotational acceleration of the brain. 
Although linear and rotational acceleration may occur 
at the same time, many mechanical testing procedures 
for helmets concentrate upon linear acceleration and 
use it as a criterion of protection in specifications. 

A typical helmet consists of an outer shell and a 
foam liner. The shell is usually made from a strong, 
durable and rigid material that is capable of spreading 
and redistributing the impacting forces without suffer- 
ing brittle fracture. This reduction in pressure lessens 
the risk of skull fracture. The foam liner has a cel- 
lular structure that absorbs energy when crushed by 
impact. Specialized designs of helmets are used in 
cycling, horse riding, canoeing, mountaineering, ski- 
ing, skate boarding, ice hockey, etc. Some designs 
are quite rudimentary and offer minimal protection. 
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In general, the wearer expects the helmet to be com- 
fortable to wear, lightweight, not restrict peripheral 
vision unduly and be reasonably compact and/or aero- 
dynamic. Production costs should be low. Increasing 
the liner thickness is beneficial but, if the use of hel- 
mets is to be promoted, there are size constraints. Thus, 
for a cricket helmet, acceptable shell and liner thick- 
nesses are about 2-3 mm and 15 mm, respectively 
(Knowles et al., 1998). 

Strong and tough helmet shells have been produced 
from ABS and GRP. The great majority of shock- 
absorbent foam linings are made from polystyrene 
(Figure 14.9): polypropylene and polyurethane are 
also used. 

14.9.2 Mechanical behaviour of foams 

Polymeric foams provide an extremely useful class 
of engineering materials (Gibson & Ashby, 1988; 
Dyson, 1990). They can be readily produced in many 
different structural forms by a wide variety of methods 
using either physical or chemical blowing agents. Most 
thermoplastic and thermosetting resins can be foamed. 
The properties of a foam are a function of (i) the 
solid polymer’s characteristics, (ii) the relative density 
of the foam; that is, the ratio of the foam density 
to the density of the solid polymer forming the cell 
walls (p/p s X and (hi) the shape and size of the cells. 
Relative density is particularly important; a wide range 
is achievable (typically 0.05-0.2). Polymer foams are 
often anisotropic. Broadly speaking, two main types 
of structure are available: open-cell foams and closed- 
cell foams. Between these two structural extremes lies 
a host of intermediate forms. 

In the case of safety helmets, the ability of a 
liner foam to mitigate shock loading depends essen- 
tially upon its compression behaviour. Initially, under 
compressive stress, polymer foams deform in a lin- 
ear-elastic manner as cell walls bend and/or stretch. 



Figure 14.9 Cell structure of polystyrene foam, as used for 
shock-absorbent packaging: average cell diameter 100 pm 
(courtesy of Chris Hardy, School of Metallurgy and 
Materials, University of Birmingham, UK). 


With further increase in stress, cell walls buckle and 
collapse like overloaded struts; in this second stage, 
energy absorption is much more pronounced and defor- 
mation can be elastic or plastic, depending upon the 
particular polymer. If the cells are of the closed type, 
compression of the contained air makes an addi- 
tional and significant contribution to energy absorp- 
tion. Eventually the cell walls touch and stress rises 
sharply as the foam densifies. This condition occurs 
when a liner of inadequate thickness ‘bottoms out’ 
against the helmet shell. The design of a helmet liner 
should provide the desired energy absorption with- 
out ‘bottoming out’ and at the same time keep peak 
stresses below a prescribed limit. 

Some polymeric structures can recover their origi- 
nal form viscoelastically and withstand a number of 
heavy impacts; with others, a single impact can cause 
permanent damage to the cell structure, e.g. expanded 
PS. After serious impact, helmets with this type of 
liner should be destroyed. Although this requirement 
is impracticable in some sporting activities, there are 
cases where single-impact PS liners are considered to 
be adequate. 


14.9.3 Mechanical testing of safety helmets 

Various British Standards apply to protective helmets 
and caps for sports such as climbing (BS 4423), horse- 
and pony riding (BS EN 1384) and pedal cycling, 
skateboarding and rollerskating (BS EN 1078). These 
activities involve different hazards and accordingly the 
testing procedures and requirements for shock absorp- 
tion and resistance to penetration differ. In one typical 
form of test, a headform (simulating the mass and 
shape of the human head) is encased in a helmet 
and allowed to fall freely through a certain distance 
against a rigid anvil. Specified headform materials (BS 
EN 960) depend on the nature of the impact test and 
extend from laminated hardwood (beech) to alloys 
with a low resonance frequency (Mg-0.5Zr). A tri- 
axial accelerometer is affixed to the headform/helmet 
assembly in the zone of impact. The area beneath the 
curve of a continuous graphical record of striking force 
v. local deformation taken during the test provides 
a useful measure of the kinetic energy absorbed as 
the helmet structure is crushed. Specified values for 
permissible peak acceleration at impact which appear 
in test procedures vary but generally extend up to 
about 300 g, where g (acceleration due to gravity) = 
9.81 m s -2 . Drop heights range from about 1 to 2.5 m, 
depending upon the striking force required. Test spec- 
ifications often include requirements for helmets to be 
mechanically tested after exposure to extremes of tem- 
perature, ultraviolet radiation and water. On occasions, 
unsafe and/or inadequate helmets are marketed: natu- 
rally, closer international collaboration and regulation 
is being sought. 
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SI units 


The Systeme Internationale d’ Unites (SI) was intro- 
duced in the UK in the late 1960s. Historically, the 
SI can be traced from the metric enthusiasms of 
Napoleonic times, through a centimetre -gram (c.g.) 
system, a centimetre -gram- second (c.g.s.) system, a 
metre -kilogram- second (MKS) system in 1900 and 
a metre-kilogram- second- ampere (MKS A Giorgi) 
system in 1950. Table A1 lists the seven basic units 
and Table A2 lists the prefixes. 

The SI is ‘rational, comprehensive and coherent’. 
Coherency means that the product or quotient of basic 
units gives an appropriate derived unit of the resultant 
quantity. A coherent system facilitates manipulation of 
units, checking the dimensions of equations and, most 
importantly, the correlation of different disciplines. 
Some of the more frequently-used derived units are 
given in Table A3. 

The force unit, the newton, is the cornerstone of the 
SI. Appropriately, the gravitational attraction for an 
apple is roughly one newton. The SI unit of stress is 
N m -2 : the pascal (Pa) is an orphan, being non-SI and 
non-coherent. Energy is defined in mechanical terms, 
being the work done when the point of application of 
a force of 1 N is displaced through a distance of 1 m 
in the direction of the force. 


Table A1 


Quantity 

Unit 

Symbol 

Length 

metre 

m 

Mass 

kilogram 

kg 

Time 

second 

s 

Electric current 

ampere 

A 

Temperature 

degree Kelvin 

K 

Luminous intensity 

candela 

cd 

Amount of substance 

mole 

mol 


Table A2 


Factor 

Prefix 

Symbol 

10 12 

tera 

T 

10 9 

giga 

G 

10 6 

mega 

M 

10 3 

kilo 

k 

10 2a 

hecto a 

h a 

10 la 

deca a 

da a 

10- la 

deci a 

d a 

10“ 2a 

centi a 

c a 

10- 3 

milli 

m 

10“ 6 

micro 

[X 

10- 9 

nano 

n 

10- 12 

pico 

P 

10- 15 

femto 

f 

10- 18 

atto 

a 


a Discouraged 


Table A3 


Physical quantity 

SI unit 

Definition 
of unit 

Volume 

cubic metre 

m 3 

Force 

newton (N) 

kg m s -2 

Pressure, stress 

newton per 
square metre 

N m " 2 

Energy 

joule (J) 

N m 

Power 

watt (W) 

J s" 1 

Electric charge 

coulomb (C) 

A s 

Electric potential 

volt (V) 

W A" 1 

Electric resistance 

ohm (Q) 

V A -1 

Electric capacitance 

farad (F) 

A s V^ 1 

Frequency 

hertz (Hz) 

s _1 
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The surprising frequency with which SI units are 
misused in textbooks, learned papers, reports, theses 
and even examination papers, justifies a reminder of 
some rules: 

1. Try to locate basic units after the solidus: 1 MN/m 1 2 
preferred to 1 N/mm 2 . 

2. A space is significant: ms is not the same as m s. 

3. Prior to calculations, convert to basic SI units: 
1 mm becomes 1 x 10~ 3 4 m. 

4. Where possible, work in steps of 10 ±3 . 


5. If possible, group digits in threes and avoid com- 
mas: 37 532 rather than 37,532 because a comma 
means decimal point in some countries. 

6. When selecting a prefix, arrange for the preced- 
ing number to lie between 0.1 and 1000: use 
10 mm rather than 0.01 m. In a comparison, one 
may break this rule, e.g. ‘increase from 900 kN to 
12000 kN’. 

7. Do not use double prefixes: pF, not /x/xF. 

8. Avoid multiples of the solidus: acceleration written 
as m/s/s self-destructs. 



Appendix 2 

Conversion factors, constants and 
physical data 


Quantity 


Symbol 

Traditional units 

SI units 

1 atmosphere (pressure) 

atm 


101.325 kN m -2 

Avogadro constant 

N a 


0.602 x 10 24 mol _1 

1 Angstrom 


A 

10 -8 cm 

lO" 10 m 

1 barn 


b 

10 -24 cm 2 

10 _28 m 2 

1 bar 


bar or b 


10 5 N m -2 

Boltzmann constant 

k 


1.380 x 10“ 23 J K -1 

1 calorie 


cal 


4.1868 J 

1 dyne 


dyn 

0.224 809 x 10^ 5 lbf 

10- 5 N 

1 day 



86 400 s 

86.4 ks 

1 degree (plane angle) 


0.017 45 rad 

17 mrad 

Electron rest mass 

m e 

9.10956 x 10- 28 g 

9.109 56 x 10 _31 kg 

1 erg (dyn cm) 



6.242 x 10 11 eV 

2.39 x 10~ 8 cal 

io- 7 J 

1 erg/cm 2 



6.242 x 10 u eV cm -2 

10 _3 J m -2 

Gas constant 


R 

8.3 143 x 10 7 erg K^mol -1 
1.987 cal K -1 mol -1 

8.314 3 J K-^ol” 1 


A1 


2.71 g cnr 3 

2710 kg cm -3 

Density < 

Fe 

Cu 

P 

7.87 g cnr 3 

8.93 g crrr 3 

7870 kg cm -3 

8930 kg cm -3 


Ni 


8.90 g crrr 3 

8900 kg cm -3 

Electronic charge 

e 

1.602 x 10~ 20 emu 

0.1602 aC 

1 electron volt 


eV 

3.83 x 10- 20 cal 

1.602 x 10~ 12 erg 

0.1602 aJ 

Faraday 


F = N\e 


9.6487 x 





10 4 C mol -1 

1 inch 


in 

2.54 cm 

25.4 mm 

1 kilogram 


kg 

2.204 62 lb 

1 kg 

1 kilogram-force/cm 2 

kgf/cm 2 

14.22 lbf/in 2 


1 litre 


1 

0.220 gal 

1 dm 3 

1 micron 


/im 

10 4 Angstrom 

10 -4 cm 

10“ 6 m 
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Quantity 


Symbol 

Traditional units 

SI units 

1 minute (angle) 

/ 


2.908 x 10 -4 radian 

2.908 x 10~ 4 rad 


A1 



70 GN nr 2 

Modulus of 

Fe 



210 GN m -2 

elasticity < 

Ni 

E 


209 GN m -2 

(average) 

Cu 



127 GN m -2 


Au 



79 GN m -2 

Planck’s constant 


h 

6.6262 x 10 -27 erg s 

6.6262 x 10“ 34 J s 


f A1 



0.34 


Au 



0.44 


Cu 



0.35 

Poisson < 

Mg 

V 


0.29 

ratio 

Pb 



0.44 


Ti 



0.36 


Zn 



0.25 

1 pound 


lb 

453.59 g 

0.453 kg 

1 pound (force) 


lbf 


4.448 22 N 

1 psi 


lbf/in 2 

7.03 x 10~ 2 kgf/cm 2 

6 894.76 N m~ 2 

1 radian 


rad 

57.296 degrees 

1 rad 


A1 


2.7 x 10 n dyn cm -2 

27 GN rcr 2 


Fe 


8.3 x 10 n dyn cm -2 

83 GN rcr 2 

Shear modulus < 

Ni 

t 1 

7.4 x 10 n dyn cm -2 

74 GN m~ 2 

(average) 

Cu 


4.5 x 10 n dyn cm -2 

45 GN m -2 


Au 


3.0 x 10 n dyn cm -2 

30 GN m -2 

1 ton (force) 


1 tonf 


9.964 02 kN 

1 tsi 


1 tonf/in 2 

1.574 9 kgf/mm 2 

15.444 3 MN nr 2 

1 tonne 


t 

1000 kg 

10 3 kg 

1 torr 


torr 

1 mm Hg 

133.322 N m -2 

Velocity of light (in vacuo) 

c 

2.997 925 x 10 10 cm/s 

2.997 925 x 10 8 m s" 1 
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Acceptor level, (band theory), 184 
Acetabular cup (hip joint), 394, 398 
Acheson carbothermic process, 334, 341 
Acrylonitrile-butadiene-styrene (ABS), 354, 355, 356, 357 
Activation energy (Q), see Arrhenius equation 
Additives: 

anti-oxidant, 353 
anti-ozonant, 353 
filler, 352 
plasticizer, 352 
stabilizer, 352 
Adhesives, 414 

Age-hardening, 260, 261-2, see also Hardening 
(precipitation-) 

Allotropy, 20, 60, 337 

Alloying elements in steel, 278, 283, 290, 298-9 

Alnico alloys, 191 

Alumina: 

crystal structure, 26 
deformation, 103, 116 
fibre reinforcement, 369, 371 
laser ruby, 196 
nanocomposites, 374 
refractories, 323 
specific moduli, 321 
strength-probability-time diagram, 349 
zirconia-toughened, ZT(A), 331 
Aluminium alloys, 316-17, 412, 414 
Al-Ag, 263-4 
Al-Li alloys, 317 
designation (IADS), 316 
mechanical alloying, 318-9 
rapid solidification processing, 318 
superplastic, 317-8 
Anelastic behaviour, 176 
Anisotropy: 

anisotropic thermal expansion, 121 
baked carbons, 340 
birefringence in crystals, 129 
composites, 362, 369 
development during solidification, 45 
elastic, 203 

optical properties of crystals, 129 


stress-anisotropy of plastics, 129 
wood, 407 

Annealing, 53, 86, 98, 117-9, 237-45 283, 378, see also 
Grain growth; Preferred orientation; Recovery stage of 
annealing; Recrystallization; Twinning 
Annealing point (glass), 345 
Anti-phase boundaries (APB), 178, 214, 232, 313 
Archery bows, arrows, strings 411-3 
Arrhenius equation, 80, 89, 175, 177, 178, 218, 237, 246, 
282, 345 
Asbestos, 28-9 

Ashby diagrams, 168, 286, 356 

Asterism, 238, 267 

Atom: 

atomic mass, relative, 7 
atomic number (Z), 4, 133 
atomic volume, 168 
Bohr model, 2 
energy states, 9-10 
interatomic distance, 21 
nuclear cross-section, 88 
transmutation, 88 
see also Electron 

Auger electron spectroscopy (AES), 46, 150, 154 

Ausforming, 283 

Austenite, 60-1, 76, 274 

Austempered ductile iron (ADI) 304-5 

Austempering, 283, 304 

Avogadro constant (N), 21 

Avrami equation, 240, 276 

Bain transformation theory, 279 
Bainite, 274, 282 

Band structure (electronic), 181-3 
conduction band, 183, 193, 194 
diamond, 339-40 
magnetism, 189 
optical behaviour, 195 
valency band, 183, 184, 193, 194 
Bardeen, Cooper and Schrieffer (BCS) theory of 
superconductivity, 186 
Barkhausen effect, 190 
Basquin’s law 253 
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B-H curves for magnets, 189, 

Bauschinger effect, 235 
Bauxite, 324-5 
Beam theory (bending), 413 
Bearings, 61 
Bend test, 201, 412 
Berman-Simon line, 338 
‘Beryllium-copper’, 53, 260, 411 
Bicycle frames, wheels, 413-5 
Biomaterials: 
bioactive, 394, 395, 399 
biocompatible, 394, 399, 401 
biodegradable, 395, 405 
bioinert, 394 
biometric, 404 
mechanical properties, 396 
requirements, general, 394-5 
Bismaleimides (BMI), 367 
Bloch wall, 190 
Boart, 337 

Boltzmann constant, 51, 80, 85 
Bonding, interatomic, 7-10 
8-N Rule, 9 

bonding and energy levels, 9 
covalent, 9 
ionic, 8 
metallic, 8 
van der Waals, 9 
Bone, human: 

fracture repair, 397 
osteoblasts, osteocytes, osteons, 398 
see also Joints 
Boric oxide, 30 
Boron nitride, 23 

Bragg diffraction law, 134, 135, 141 
Brasses: 

brazing alloys, 414 
compounds, 24, 78 
phase diagram, 60 
season cracking, 386 
Bridgman method (single crystals), 47 
Bright- annealing of copper, 378 
Brillouin zone, 75, 77, 181-2 
Buckminsterfullerne, 344 
Bulk density, 321 
Bulk modulus, 197 

Burgers vector, 91-2, 102, see also Thompson 
tetrahedron 

Capacitors, 193-4 
Carbon, 337-45, 353 
Carbon electrodes, baked, 340 
Carbon, vitreous, 343 
Cast ingot structure, 44-5 
Cast irons, 303-5 

austempered ductile (ADI), 304 
grey, 304 
malleable, 304 
mottled, 303 

spheroidal graphite (SG), 304 
white, 303 

Cathodic (sacrificial) protection, 386, 415 
Cement and concrete: 
reinforcement, 372 
thermal analysis, 165 
Cementite, 60, 274 


Ceramics: 

classification, 320 
elastic deformation 203 
firing, 322, 325, 326 
general properties, 321-2 
optical applications 195 
production of powders, 322-3 
testing, 200-1 

thermal shock resistance, 334 
time-dependence of strength, 348-9 
transformation-toughening, 330 
Channelling patterns, 145-6 
Chemical stress, 89 
Chemisorption, 378 
Coble creep 175-6, 249 
Cofhn-Manson law 253 
Coherency (interfaces): 
devitrification, 332 
inoculation, 45 
laser-heated surfaces, 392-3 
nucleation in solids, 82-3 
precipitation-hardening, 262 
recrystallization, 245 
strain, 191 

Cold-drawing (plastics), 355 
Collagen, 397, 400, 402 
Colour, 195 

Compliance, elastic, 202, 413 
Composites, fibre-reinforced: 
principles: 

functions of matrix, 361 
modulus ratio versus stress ratio, 362 
Rule of Mixtures, 362 
see also Fibres 
types: 

carbon fibre (CFRP), 367, 369, 408, 410, 412 
ceramic-matrix (CMC), 372-3 
continuous-fibre, 361-3 
duplex steel 416 

glass-reinforced (GRP), 361, 366 
metal-matrix (MMC), 368-72 
polymer-matrix (PMC), 366-7 
short-fibre, 363 
Compounds: 

AB-type, 24 
AB 2 -type, 24 

AB 204 -type (spinel), 26-7 
ABOs-type, 26 

deviation from stoichiometric composition, 87 

electrochemical, 76-7 

electron, 78-9 

intercalation, 343-4 

intermetallic, 312-5 

Laves phases, 77 

order-disorder transitions, 79-80 

size-factor, 77 

Conservative motion, of dislocations, 94 
Considered construction, 199 
Conversion factors and constants, 422-3 
Coordination: 

coordination number (CN), 20 
ionic crystals, 22 
Pauling Rules 22-4 
radius ratio (r/R), 22 
Cordierite, 72 
Cords in glass, 117 
Coring, 53-4 
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Corrosion, aqueous: 

differential aeration cell, 382 
electrochemical principles, 382-4 
failures, 386-7 
inhibitors, 384 
passivity, 384, 385-6 
prevention, 384-6 
Corrosion-fatigue, 254, 387 
Crazing of plastics, 355-6 
Creep, metallic 199-200, 245-51 
fracture, 249 

grain boundary diffusion (Coble), 175-6, 249 
grain boundary sliding, 247, 258 
Herring-Nabarro, 176, 249 
tertiary, 249 
testing, 199-200 
transient and steady-state, 245-6 
Creep-resistant alloys, 249-51 
Critical held and temperature (superconductivity), 
185-7 

Cross-slip, 93, 101-2, 210-1, 214, 230, 232, 268 
Crystal structures: 
alumina, 26 

barium titanium oxide, 26 
boron nitride, 23 
cristobalite, 25 
CuAu, 79, 179 
Cu 3 Au, 79 
CuA1 2 , O', 270 
CuZn, 24, 79 
diamond, 21, 103 
Fe 3 Al, 79 

graphite, 21, 341-4 
Kaolinite, 29 
Ll 2 structure, 115 
magnesia, 24 
martensite, 278 

metals at room temperature, 20 
MgCu 2 , 77 
perovskite, 26, 187 
Potassium graphite 343-4 
polyethylene, 38 
silicon nitride, 326 
TiAl (Ll 0 ), 314-5 
zinc sulphide (blende), 23 
YBa 2 Cu 3 07-x, 187 
Crystallinity in polymers, 36-7, 38 
chain-folding model, 39 
crystalline melting point (r m ), 38, 355 
defects, 116-7 
microscopy, 129-30 
spherulites 40, 117, 355 
Crystallography: 

axial ratio (c/a), 15, 19, 204, 278 
crystal systems, 12 
directions and planes, 14-16 
equivalence, 16 
interplanar spacing, 134 
lattices, 12-13 
Miller-Bravais indices, 15 
Miller indices, 14 
reciprocal lattice, 141-2 
stereographic projection, 16-19 
symmetry, 17-19 
unit cell, 12 

Vector Addition Law, 17 
Weiss Zone Law, 16 


Curie temperature (point), 27, 190 

Curing of thermosetting resins, 37, 357, 415 

Czochralski ‘crystal-pulling’ (single crystals), 47 

Dacron 400-3, 413 

Damping capacity, 176, 407-8 

de Broglie relation, 10, 125 

Debye characteristic temperature, 170-1 

Debye-Scherrer (power) method of X-ray analysis, 136 

Deep-drawing, 234 

Defects: 

ceramics, 323 
defect lattice, 80 
defect tetrahedra, 108 
glasses, 117 
line, 90-7 
planar, 97-103 
point, 84-90 
polymers, 116-17, 360 
volume, 104 

see also Vacancies; Dislocations; Radiation damage; 

Stacking fault; Voids 
Deformation, elastic, 201-3 

Deformation mechanisms (Ashby) maps, 176, 251-2, 356 

Dendrites, 43-4 

Density: 

dislocations, 90 
intermetallic compounds, 312 
theoretical, calculation of, 21, 168 
various materials, 321 
see also Property ratios 
Dental materials, 395-7, 
bioglass, 397 
implants, 397 
porcelain, 397 

Detonation-gun (D-gun) method of coating, 391 
Devitrification, 31, 117, 331-2 345 
Dezincihcation, 386 
Diamond: 

classification, 339-40 
natural, 337-8 

polycrystalline (PCD), 340, 371 
structure, 8, 21 
synthetic, 338 

Dielectric materials, 70, 72 193-4, 324 
Differential scanning calorimetry (DSC), 165-6 
Differential thermal analysis (DTA), 165 
Diffusion: 

Cu/Zn couple, 60 
diffusion coefficient (D), 173 
Fick’s 1st law, 173, 175 
Fick’s 2nd law, 54, 173 
interstitial, 174 
mechanisms, 173-5 
oxidation, during, 379 
pipe, 250 

radiation-induced, 124 
reptation in polymers, 41 
self- 86, 175 
stress-induced, 174-5 
surface, 175-6 
‘uphill’, 273 
vacancy, 174 
Diffusion bonding, 370 
Dilatation strain, 202, 215 
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Dilatometry, 169 

Dipoles in dielectric materials, 193, 194 
Dislocation: 
behaviour of: 

‘atmosphere’ (Cottrell) locking, 212, 214-6 
chemical stress, response to, 90 
climb, 93, 94, 110-2, 237, 246-7, 288 
cross-slip, 93, 101-2, 210, 214, 230, 232, 268 
decoration of, 155 
dissociation, 99-100, 105, 108, 116 
Frank-Read source, 210, 223, 265 
interaction, 96 

Lomer-Cottrell barrier, 103, 107 
motion, conservative and non-conservative, 94 
multiplication, 210 
pile-up, 247, 288 
characteristics: 

Burgers vector, 91-2 
density, 230, 237, 371 
force acting per unit length, 92 
image formation in TEM, 147, 155-6 
jog, 93-4, 97, 108 
kink, 216 
line tension, 210 
loops, see Dislocation loops 
strain energy, elastic, 95-6 
velocity, 207-8 
width, 207 
forms: 

dipole, 94, 228 
edge, 91 

extended, see Stacking faults 
extrinsic, 98 
‘forest’, 93, 238 
grain boundary (gbds), 97-8 
intrinsic, 98 
ionic crystals, fee, 97 
misfit, at interface, 83 
partial (Shockley), 99, 105 
screw, 91, 97 
sessile (Frank), 102 
stair-rod, 102, 107-108 
superdislocations, 114, 214, 231 
unit, 106 
loops: 

cross-slip multiplication source, 210-1 
double, 107, 111, 119 
growth and stability, 117, 269 
irradiation effects, 119-20 
Orowan, 231, 266, 371 
prismatic, 94, 106, 119 
sessile (Frank), 102, 104, 107, 121 
single, 91, 93 

Domain structures, 178, 190 
Dough-moulding compounds (DMC), 366 
Drug delivery, polymeric, 405 
‘Ductile’ ceramics 286-7 
Dulong and Petit law, 170, 179 
Duplex (double) ageing, 271 
Duplex steel, 416 
Duralumin alloys, 260 

Elastomers, 35-6 
Electrical conductivity, 181-3 
ceramics, 321 
effect of ordering, 180 


Electrical contacts, 62 

Electrochemical effect, in alloying, 73, 74 

Electrochemical Series, 383 

Electrode (half-cell) potential, 382 

Electrographites, 340 

Electromagnetic spectrum, 125, 194 

Electron: 

-atom ratio (e/a), 75, 78, 186 
Auger, 142, 143, 150, 154 
band structure theory, 181-3 
back-scattered (BS), 144, 150 
exchange energy (magnetism), 190 
‘gas’, 8 

orbitals, radii of, 190 
polarization in dielectrics, 193 
quantum numbers, 2-4 
secondary, 150 

states, density of, 9-10, 75-6, 182 
states, filling of, 3-7 
volt, 10 

see also Atom: Electron microscopy 
Electronegativity, 74,77 

Electron energy loss spectroscopy (EELS), 150, 152-4 
Electron microscopy, 142-54 
back-scattered (BS) electrons, 144 
bend (extinction) contours, 156 
bright-and dark-held imaging, 147, 161 
convergent beam diffraction pattern (CBDP), 149 
diffraction contrast, 147 
dynamical theory, 158-60 
electron channelling, 145-6 
hrst-order Laue zones (FOLZ), 150 
g- vector, 156-8 

higher-order Laue zones (HOLZ), 149 

higher- voltage electron microscopy (HVEM), 149-50 

imaging of dislocations, 147, 157-8, 159, 166 

Kikuchi lines, 148 

kinematical theory, 156 

scanning electron microscope (SEM), 144-6 

scattering of plasmons and electrons, 142-3 

selected area diffraction (SAD), 147 

transmission electron microscope (TEM), 143-4 

weak-beam, 160-1 

Electron probe microanalysis (EPMA), 150-2 
Electro-optic ceramics, 196 
Emery, 324 

Epitaxy, see Coherency 
Epoxy resins (adhesives), 36, 412, 415 
Equilibrium diagrams, see Phage diagrams 
Etching techniques for microscopy, 126, 154 
Eutectic reaction: 
binary systems, 56 
in-situ composites, 374 
ternary systems, 66-9 
Eutectoid reaction, 58, 274 
Ewald ‘reflection’ sphere, 141-2, 148, 156 
Extended X-ray absorption fine- structure spectroscopy 
(EXAFS), 163 
Extrusion of plastics, 356-7 

Fatigue, 200, 252-8 
Basquin’s Law, 253 
Cofhn-Manson Law, 253 
cracking and failure, 256-8, 295-6, 413, 416 
corrosion-, 254, 387 
endurance limit, 200 
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fatigue limit, 252, 254 
fatigue ratio, 253 
hardening, 256 
high-temperature, 258 
Miner’s hypothesis, 254 
S-N diagram, 200, 252 
surface striations, 257-8 
testing, 200, 252-3 
variables, 252-4 
Feldspars, 30 
Fencing foils, 415-6 

Fermi surface (level), 10, 75, 183, 186, 189 
Ferri-and ferroelectric materials, 194 
Ferrite, 60, 76, 274, 301 
Fibres for composites: 
aramid, 367 
aspect ratio, 363 
boron, 369 
carbon, 367 
coupling agents, 364 
critical length, 363 
glass, 285, 361, 366 
lay-up sequence, 365 
orientation, 364-6 
stress transfer length, 363 
whiskers, 284, 369 
Firebricks, 63 

Floating-zone technique (single crystals), 47 
Flow stress 203, 206, 219, 233, 235, 268 
Fluxes: 

alumina, 325 
dielectrics, 72 
glasses, 30, 31 
silicon nitride, 326 
solders, 56 

Foams (polymeric), 169, 287 418 
Forsterite ceramics, 71 
Fracture: 

brittleness in ceramics, 286-8 
cleavage, 199, 288-9 
creep, 249, 293-4 
cup-and-cone, 199 

debonding in composites, 364, 371, 372 
ductile-brittle transition, 198, 289 
ductile fracture, 104, 292-3 
fatigue, 256-8, 295-6, 348 
fracture mechanism map, 294-5 
hydrogen, effect of, 291 
intercrystalline, 199 
intergranular, 287 

slow (delayed) crak growth in ceramics, 348 
toughness, 285-7 

toughness parameter (K c ), 199, 286 
transformation-inhibited, 331 
twins, effect of, 224 

Frank Read Source, dislocations, 210, 223, 265 

Fretting corrosion, 387 

Fullerenes and fullerite, 344 

‘Fused quartz,’ 345 

Fusible alloys, 68, 408 

Galvanic Series, 383, see also Electrochemical Series 
Gauss error function, 174 
GEC process for synthesizing diamonds, 338 
Gibbsite, 325 


Glass: 

manufacture: 

commercial glass, of 31 
formation from melt, 31 
working range of temperature, 345-6 
see also Fluxes 
properties: 
colour, 195 
optical, 129 
specific moduli, 321 
time-dependency of strength 348 
viscosity, 333, 345 
structure: 

devitrification, 31, 117, 331-2, 345 
network-formers, 31 
network-modifiers, 31, 117 
network structure, 11, 31 
types: 

AR-, E-, H-modulus, S-, 366 
aluminosilicate, 346 
borosilicate ( Pyrex ), 346 
chemically-strengthened, 346-7 
‘fused quartz’ 345 
laminated, 347 
‘lead crystal’, 346 
photochromic, 195 
soda-lime, 346 
thermally-tempered, 346-7 
Vycor, 48 

Glass-ceramics, 322, 331-4 

Glass transition point ( T g ) 35, 351-2, 355, 367 

Glaze, 324 

Globars (SiC), 336 

Golf club shaft, and heads 410-1 

Goss texture, 146 

Grain boundary: 

cavitation during fatigue, 258 

coincident site lattice (CSL) model, 98 

diffusion creep (Coble), 176, 249 

dislocation pile-up at, 155 

fracture effect on, 290 

grain boundary sliding, 247-8, 258, 288 

high-angle, 43, 97, 239 

intergranular nucleation, 240 

low-angle (tilt), 43, 97 

tripe junction, 99 

twist, 97 

work-hardening at, 231-2 
Grain growth, 242-3 
Grain size: 

dual-phase steels, 301 
heat-treatment (steel), effect on, 276-7 
toughness, effect on, 289-90 
Hall-Petch equation, 216, 219-20, 224, 289 
measurement, 131-2 
yield strees, effect on, 219-20 
Grain structures: 

cell formation, 230, 238 
chill crystals, 44 
columnar crystals, 45, 46 
equiaxed grains, 43, 45 
planes of weakness, 45 
Graphite: 

conventional structure, 22 
intercalation compounds, 344 
pyrolytic (PG), 341-3, 368, 402 
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Graphite: ( cont .) 
turbostratic, 367 

‘Green’ powder compact, 322, 325 
Greninger-Troiano theory, 279 
Griffith micro-crack criterion, 284, 289 
Griffith-Preston (GP) zones, 261, 267-8 
g-vector in electron microscopy, 157-8 

Hall-Petch equation, 216, 219, 224, 289 
HAPEX, composite, 399, 402 
Hardening: 
chemical, 266 
coherency strain-, 265-6 
dispersion-, 231, 250, 266-7, 301, 302 
fatigue-, 255-6 

heat-treatment of steels, 274-84 
irradiation, 290 
point defect, 224-6 
precipitation-, 53, 259-74, 317 
secondary, 283 
texture-, 233-5 
work (strain)-, 226-32 
Hardness (indentation): 

Brinell, 199 
hot, 329 
Knoop, 130 
Meyer line, 131 
microhardness, 130-1 
Vickers, 130, 199 

Heat-affected zone (HAZ), 405, 414 
Heisenberg’s Uncertainty Principle, 2 
Helmets Safety, 417-8 
Herring-Nabarro Creep, 176, 249 
Heusler alloy, 188, 192 
Hooke’s Law, 197, 202 
Hume-Rothery Rules, 73 
Hund’s Rule, 4, 7 
Hydrogen embrittlement, 291 
Hydroxyapatite (HA), 396, 397, 399, 402 

Impact testing, 199, 418 

Internal friction, 176-7 

Investment casting (‘Lost wax’ process), 410 

Ionic crystals: 

coordination in, 22 
dislocations in, 97 
In- situ composites, 373-4 
Inclusion counting methods, 131-2 
Inoculation of melts, 45 
Insulators, electrical, 193-4 
Intermediate phases, 59, see also Commpounds 
Intermediate compounds, 312-5, 370 
Interstitial atoms, 86-7, 122 
‘Invisibility’ criterion (g) in electron microscopy, 
157-8 

Ion implantation, 391-2 
Iron, 172 

Iron-nickel alloys ( Permalloys ), 191 
Iron- silicon alloys (magnets), 191, 243 
Isoforming, 283, 284 
Isostatic pressing: 
alumina, 325 
silicon nitride, 325-6 
Isothermal annealing, 283 


Johnson-Mehl equation, 240 
Joining: 

adhesives, 414 
brazing, 414 
HAZ problems, 414 
Tungsten-inert gas (TIG) welding, 414 
Joints, human: 
failure, 399 
finger, 399 
hip, 394, 398 
knee, 399 
shoulder, 399 
wrist (carpal), 400 

Kaolinite, 29, 63 
Ke torsion pendulum, 177 
Kear-Wilsdorf (K-W) lock, 232 
Kevlar, 367, 408, 410 
Kikuchi lines, 148, 157 
Kirkendall effect, 123, 175 
inverse, 123 

Kurdjumov-Sachs orientation relation, 278 

Larsen-Miller parameter, 308 
Lasers, 195-6, 412 
Lattice friction, 207, 225 
Lattice rotation, 205-6 
Laue method of X-ray analysis, 135 
Laves phases, 26, 77 
Le Chatelier principle, 49 
‘Lead crystal’ glass, 346 
Lever Rule, 52 

Liquid pressure forming (LPF), 370 
Liquidus, 52 

Littleton softening point (glass), 345 
Lomer-Cottrell barrier 103, 107 
Liiders band, 211-2 

M s temperature, 276, 281, 283 
Magnesia (MgO), 8, 115 
Magnet: 

hard (permanent), 188-9, 191 
soft (temporary), 188-9, 191 
Magnetic alloys, 191-2 
Magnetic hysteresis (B-H loop), 189, 191 
Magnetic remanence, 189 
Magnetic susceptibility, 188-9, 192 
Magnetism: 

anti-ferro-, 162, 192 
dia-, 189 
ferri-, 193 

ferro-, 162, 172, 188, 189-90 
para-, 162, 188 
Magnetostriction, 191 
Magnox alloys, 258 
Manganese sulphide in steel, 60 
Martempering, 283 
Martensite, 61, 76, 274-5, 278-83 
Material property (Ashby) chart, 168, 286 
Maxwell-Boltzmann distribution law, 80 
Mechanical alloying, 301-2, 318-9, 369 
Meissner effect, 186 
Melt flow index (MFI) test, 358 
Melting point: 

congruent, 52, 58, 66 
creep-resistant alloys, 249 



Index 433 


electrochemical effect in alloying, 74 
incongruent, 58 
intermetallic compounds, 312 
intermediate phase, of, 76-7 
pure metal, 42 

Michel-Levy colour chart, 129 
Microscopy, light: 
etching, 126 
hot-stage, 129-30 

illumination, bright- and dark-held, 126 
magnification, 127 
microhardness testing, 130-1 
numerical aperture (NA), 127 
objective lens, 126 
Burch reflecting, 130 
oil-immersion, 127 
ocular lens, 126, 127 
parfocalization, 127 
phase-contrast, 127-8 
polarized light, 129 
quantitative, 131-2 
grain size, 131 
inclusions, 131 
stereological notation, 131 
resolution, 126 
tube length, 127 
Miller-Bravais indices, 15-16 
Miller indices, 14 
Mineralizers, 25 

Miscibility, solid see Solid solution 
Modifiers, see Glass 

Modulus of rupture (MoR), 201, 321, 411 
Moduli, elastic, 197, 203, 321, 396 
Molecular mass distribution (MMD): 
in polymers, 33, 360 
polydispersivity index, 33 
Molecular sieves, 30 
Molybdenum disulphide, 22 
Moment of Inertia (I), 410, 413 
Monel , 52 

Monotectic reaction, 59, 61, 64 
Moulding of plastics: 
blow-, 357, 359 
injection-, 356-7, 360 
reaction injection- (RIM), 357-8 
tennis rackets, 408 
Mullite, 63, 335 

Multiple cross-slip (Koehler), 210 
Mylar, 413 

n-type semiconductor, 87, 184, 380, 381 

Nanocomposites, 374 

Neel temperature, 192 

‘Necking’ during tensile test, 198 

Nernst equation, 383 

Neumann bands, 223 

Neutron: 

diffraction, 161-2, 179 
mass absorption coefficients, 162 
scattering amplitudes, 162 
‘thermal’, 161 
Newtonian how, 345, 358 
Nichrome alloy, 382 
Nickel aluminides, 312-4 
Nimonic alloys, 249, 295 
Nishiyama orientation relation, 279 


Nitridation, 325-6 

Non-destructive evaluation (NDE) of ceramics, 323, 337 
Notch-sensitivity, see Fracture; Impact testing 
Nuclear hssion, 88, 122, 161 
Nucleation and growth processes: 
defect tetrahedra, 108 
glass-ceramics, 332 
heterogeneous nucleation, 43, 82 
homogeneous nucleation, 42, 81-2, 332 
nucleating agents (inoculants), 45, 333 
nucleation in solids, 82-3 
oxidation, 378 
pearlite formation, 276-8 
precipitation-hardening, 53, 259-60, 270-1 
recrystallization, 239-42 
spherulite formation in polymers, 40 
twinned crystals, 222-3 
void formation, 104, 292, 371 
Nylon, 355, 359, 366, 409 


Ophthalmic materials (lenses), 404 
Optic axes of crystals, 129 
Optical hbres, 195 
Optical properties, 195-6, 339 
Ordering of atoms: 

effect on physical properties, 179-80 
entropy of disorder, 49-51 
irradiation effects, 123-4 
magnets, in, 191-2 

order-disorder transformation, 60, 79-80 
ordered solid solutions (superlattice), 79, 113-4, 178, 
192, 231 

short-range, long-range 11-12, 177-8 
work-hardening, effect of, 231-2 
Orowan loops, 231 
Ostwald ripening, 272 
Overvoltage, 383 
Oxidation of metals: 
kinetics, 378-80 
logarithmic rate law, 379-80 
parabolic rate law, 379 
thermodynamics, 376-8 


Pacemaker, heart, 403 
p-type semiconductors, 87, 184, 380, 381 
p-n-p semiconductor, 184-5 
Paris-Erdogan (fatigue) equation, 295-6 
Particles, precipitate: 
coarsening of, 272-3 
stability of, 74 

Pauli Exclusion Principle, 2, 9, 190 
Pauling Rules, of coordination, 22-4 
Pearlite, 60, 276-8 

Peierls-Nabarro stress, 207, 213, 280, 312 
Periodic Table, 4-7, 74, 151 
Peritectic reaction: 

in binary systems, 57-8, 60 
in ternary systems, 69 
Peritectoid reaction, 58 
Permalloys, 191 
Permittivity, 193 

Persistent slip band (PSB), 255-7 
Phase, 48 
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Phase diagrams: 
principles: 

arrest points, 60 
binary, 52 

double-reciprocal, 327 
Gibbs triangle, 65 
Lever Rule, 52, 56 
limitations, 59-60 
liquidus, 52 
miscibility gap, 123 
Phase Rule, 64 
solidus, 52 
solvus, 52 
ternary, 65-72 
tie-line, 52, 64, 66, 70 
tie-triangle, 65 
unary, 48 
systems; 

Al-Cu, 78, 259-60 
A1 2 0 3 - Si0 2 , 63 
Au-Ni, 73 
Au-Pt, 73 
Cr-S-O, 382 
Cu-Ag, 73 
Cu-Be, 53 
Cu-Pb, 61-2 
Cu-Sn, 78 
Cu-Zn, 60, 78 
Fe-C, 60-1, 274 
Fe-Cr, 298 
Fe-Ni, 298 

MgO - A1 2 0 3 - Si0 2 , 70-2 
Mg-Si, 59 
Ni-Cu, 52, 73 
Ni-Pt, 73 
Ni-S-O, 64-382 
Pb-Sn, 57 
Si-Al-O-N, 327 
Ti-Al, 309 
Ti-Cu, 309 
Ti-V, 309 
Zr0 2 - Y 2 0 3 , 330 
Phase equilibrium: 

Class I reactions, 68 
Class II reactions, 69 
Class III reactions, 69 
four-phase, 68-9 
two-phase, 52-6 
three-phase, 56-8, 61-8 
Phase Rule, 48-9, 64 
Phase transformations: 
austenite-bainite, 282 
austenite-martensite, 278-82 
ceramics, 63 
conversions, 24-5 
devitrification, 31 
diffusionless, 275 
inversions, 24 

order-disorder transformation, 60 
pure metal, 48 

solidification of pure metal, 42 
specific heat, changes in, 171 
thermodynamical aspects, 50 
volumetric changes, 169 
zirconia, 330-31 

Phenol-formaldehyde resin (P-F; Bakelite ), 36, 37, 357 


Piezoelectric effect, 194 
Pilling-Bedworth (P-B) ratio, 379 
Pistons, diesel, 371 
Plasma-spray method of coating, 391 
Plasmon interactions, 142, 153-4 
Platinum phthalocyanine, 117 
Plasticity, macroscopic, 235-7 
Poisson ratio, 197, 202 
Polarization, 193, 194 
Polarized light microscopy, 129 
spherulites in polymers, 40 
Polyacrylonitrile (PAN), 366, 408 
Polybutadiene, 36 
Polybutylene (PB), 353 
Polycarbonates, 354 
Polychloroprene, 36, 354 
Polyester resin, 37, 366 
Polyether ketone (PEEK), 366, 409 
Polyethylene (PE), 33, 39, 41 
linear low-density (LLDPE), 353 
low-density (LDPE) and high-density (HDPE), 34, 38, 
353 

ultra-high molecular weight (UHMPE), 353 
Polygonization, 238 
Polyimides (PI), 367 
Polyisoprene (natural rubber), 36, 354 
Polymerization: 

condensation polymerization, 37 
copolymerization, 35 
degree of {n), 32-3 
vulcanization, 35 

Polymethyl methacrylate (PMMA), 38, 352, 357 
Polypropylene (PP), 34, 35, 39, 130, 353, 354, 355, 359, 
366 

Polystyrene (PS) 34, 38, 40, 352, 354, 355, 356 
Polytetrafluoroethylene (PTFE),352, 366 
Polytypoids (polytypes), 328, 334 
Polyvinyl acetate (PVAc), 34 
Polyvinyl butyrate (PVB), 347 
Polyvinyl chloride (PVC): 
plasticized, 34, 352, 353, 357 
unplasticized (UPVC), 353 
Porosity, gas: 

scavenging treatment of melts, 45-6 
Sievert’s relation, 45 
Pourbaix diagram, 384 

Powder (Debye-Scherrer) method of X-ray analysis, 136, 
142 

Powders, ceramic 322-3, 324-5, 334 
Preferred orientation: 
annealing texture, 233 
cube texture, 191, 245 
deformation textures, 232-3 
Goss texture, 146, 243 
recrystallization texture, 245 
texture transition, 233 
Preferred (Renard) numbers, 127 
Prepregs (pre-impregnated shapes), 366 
Proof-testing, 201, 348 
Property ratios, specific: 

E/p, 321, 410, 411, 412, 413, 416 
MoR Ip, 411 
Ki c /oy, 286, 287 
Gy/p, 169, 369, 407, 413 
Prosthetic materials: 
ear implants, 402 
heart valves and arteries, 402-3 
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maxillofacial surgery, 401 
plastic surgery, 400 
tissue repair, 403 
Pykecrete, 361 
Pyroelectric materials, 194 

Quantum theory, 2-4, 170 

Radiant tubes (gas-fired furnaces), 336 
Radiation damage: 

damage cascades, 121-3 
displacement spike, 88 
electron miroscopy, 149 
growth and swelling, 121-3 
induced precipitation, 123 
induced segregation, 123 
microscopy, 149 

neutron ‘annealing’ of metals, 90 
ordered alloys, 123-4 
radiation hardening, 225 
thermal spike, 88 
void formation, 104 
Radiography, 133 

Reaction-sintering (bonding), 325, 335 
Read-Shockley formula, 97 
Reciprocal lattice, 141, 148, 156 
Recovery stage of annealing, 86, 237-8, 246 
Recrystallization, 146, 239-42, 243, 293, 319, 342 
REFEL silicon carbide, 334 
Refractories: 
properties: 

general, 25, 29, 62-4 
modulus of rupture, 201 
pyrometric cone equivalent (PCE), 62 
refractoriness, 63, 322 
spalling, 322 
types: 

alumina, 63, 323-4 
aluminosilicate 62-3 
carbon, 337 
insulating, 322 
mullite, 63 
silica, 63 

silicon carbide, 334-5 
zirconia, 330-1 

Relative valency effect, in alloying, 73, 74 

Relaxation modulus (polymers), 351 

Relaxation time (anelastic), 176, 359 

Reptation in polymers, 41 

Resins, thermosetting, 36-8, 357-8, 415 

Resolution, 126 

Reversion (retrogression), 260 

Rubber: 

decomposition, 164 
hard {Ebonite), 35 
natural, 354 
repeat units, 36 
silicone, 36, 371 
styrene-butadiene-, 36, 354 

SI units, 420-1 

S — N diagram (fatigue), 200 

Schmid’s Law, 233 

Season-cracking of brass, 386 

Secondary ion mass spectrometry (SIMS) 163-4 

‘Seeds’ in glass, 117 


Segregation: 

cellular microsegregation, 54-5 
inverse, 46 
normal, 46 

X-ray analysis of, 141 
Selected area diffraction (SAD), 148 
Semiconductors, 87, 103, 138, 183-5, 195, 336, 340, 392 
Sensitization of austenitic steels, 386 
Shape-memory effect (SME), 315 
‘Sharkskin’ (plastics), 360 
Shear strength of crystals, theoretical, 90-1 
Shear stress for slip, resolved 203-4 
Sheet moulding compounds (SMC), 357, 366 
Shrinkage: 
firing, 325 
plastics, 361 
silicon nitride, 326 
solidification, 43 
SI units, 420- 1 
Sialons, 326-9 
Sievert relation, gas, 45 
Silica, 24-5, 30 
Silicates 27-30 
Silicon carbide, 24, 334-7 
Silicon-lithium detector, 152 
Silicon nitride, 326 

hot-isostatically-pressed (HIPSN), 326 
hot-pressed (HPSN), 326 
reaction-bonded (RBSN), 325-6 
Single crystals: 
polyethylene, 39 
production for research, 47 
slip and lattice rotation, 204-5 
stress-strain curves, 226 
turbine blades, 46, 250, 307 
X-ray diffraction analysis, 135 
Sink-marks (plastics), 361 
Sintered aluminium powder (SAP), 242, 250 
Sintering, 119, 243, 322, 325, 330 
Size factor, in alloying, 73, 77 
Skiing boots bindings, skis, poles, 417 
Slip: 

comparison with twinning, 203 
critical resolved shear stress, 205 
dislocation movement, 92-4, 110-1 
lines and bands, 205, 211 
multiple, 205-6 

overshooting of primary system, 206 
persistent slip band (PSB), 255-6 
resolved shear stress, 203-4 
systems in metals, 204 
see also Cross-slip 
Snowboarding equipment, 416 
Sol-gel process, 323 
Solders, 56 
Solid solution: 
ceramic, 326 

complete miscibility in ternary system, 66 
coring in, 53-4 
extended (continuous), 52 
interstitial, 76 
ordered, 79 

Primary (terminal), 52-3, 74-5 
solvus, 53 

substitutional, 73, 79 

see also Hume-Rothery Rules 
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Solidification: 

cellular front, 54-5 
dendritic, 43-4 
directional (DS), 46-7, 374 
kinectics, of, 81-2 
plane-front, 43-4 
pure metal, of, 42 
volume changes, 170 
Solidus, 52 

Solution heat-treatment, 259-60 
Spark plugs, 324 

Specific heat (capacity), 170-1, 179 

Specific modulus, 321 

Spherulites, see Crystallinity in polymers 

Spinels: 

degree of inversion, 27 
ferrospinels (ferrites), 27 
inverse, 27 
oxide scale, 381 
structure, 26-7 

Spinodal decomposition, 273-4 
Sputtering yield (Y), 389-90 
Squeeze-casting, 370 
Stacking fault: 

ceramics, in, 103, 115-6 
corrosion, effect on, 386 
cross-slip, 101-2 
energy, 48, 98, 100 
extrinsic, 102 

imaging in electron microscope, 158, 159 
intrinsic, 102, 105 
ordered structures, 113-4 
structure, 99-102, 112 
tetrahedra, 93, 104, 107-8 
width, 101 
work-hardening, 227 
Stacking sequences: 

cubic structures, 19, 112 
fee crystals, 99, 102 
hexagonal structures, 19, 110-1 
intercalation compounds, 343-4 
precipitation-hardening, 269 
pyrolytic graphite, 341-2 
semiconductors, 103 
Steatite: 

low-loss-factor, 70 
porcelains, 70 
Steels, types of: 
alloy, 278, 298-9, 302-3 
austenitic (stainless), 291, 298, 303 
designation (BS) of steels, 302-3 
dual-phase (DP), 300-1 
duplex, 416 

high-strength low-alloy (HSLA), 286, 299-30 
maraging, 299, 416 
mechanically- alloyed (MA), 301-2 
non-strain-ageing, 212 
plain carbon, 277, 297-8 
Stereographic projection, 16 
preferred orientation, 232-3 
slip, 205 

Thompson tetrahedron, 105 
Stiffness constants, elastic, 202 
Stirling’s approximation, 85 
‘Stones’ in glass, 117 
Strain-age hardening, 211, 215, 217-8 


Strain-anneal technique, for single crystals, 47 
Strain-hardening, see Work-hardening 
Strain ratio (R), 234 

Strength-probability-time (SPT) diagrams, 348 

Strengthening, see Hardening 

Stress: 

chemical, 89 

cycles (fatigue), 252-4 

flow, 203, 206, 219, 233, 235, 268 

intensity factor (K), 286 

nominal, 197 

Peierls-Nabarro, 207, 213, 288, 312 
relaxation in polymers, 351, 359-60 
resolved shear, 203-4, 220, 222 
tensor, 202 
true, 198 

yield, variation with temperature and strain rate, 208-9 
Stress-corrosion: 
ceramics, of, 348 
cracking (SCC), 386 
Stress-recrystallization, 342 
Stress-strain curve, 197, 226-7, 362 
Stretcher strain markings, 212 
Stroh (fracture) mechanism, 288 
Styrene-butadiene-rubber (SBR), 36, 354 
Superalloys: 

alloying elements, 305-8 
eutectic (NITAC, COTAC), 374 
hot corrosion of, 64 
Superconductors, 26, 181, 185-7 
Superlattice, see Ordering of atoms 
Superplasticity, 220-1, 317-8 
Surface (interfacial) energy: 
ductile failure, 293 
grain growth, 242-3 
hydrogen embrittlement, 291 
nucleation, influence on, 81-2 
particle-coarsening, 272-3 
sintering, 243 
triple junctions, 99 

Surgical materials see Prosthetic materials 
Swell, degree of, 359 
Symmetry in crystals: 
cubic, 17 
elements, 17 
Laue patterns, 135 
point and space groups, 18 
tetrahedral, 18 

Synchro-shear (Kronberg), 116 
Synchrotron radiation studies (SRS), 162-3 

Tacticity, in polymers; syndio-, iso- and atacticity, 35, 39 
Talc (French chalk), 9, 30 
TD (thoria-dispersed)-nickel, 250, 295, 307 
Temper embrittlement, 291 
Tempering of martensite, 282-3 
Tennis racket frames and strings, 407-9 
Tensile test, 198, see also Yielding, discontinuous 
Thermal analysis, 164-6 
Thermal expansion, 168-70, 372, 407 
Thermistor (PTC), 185 
Thermodynamic criteria for equilibrium: 
entropy, 49-50, 74-5, 374 
free energy, 49-50 

aqueous corrosion, 382-4 
oxidation of metals, 376-82 
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recrystallization, 239 
transformation, of, 171-2 
variation with temperature/composition, 50, 75 
heat content, 49 

Thermo-mechanical (THT) treatments, 283 
Thermocouple sheaths, 323 
Thermoforming of plastics, 356-8 
Thermogravimetric analysis (TGA), 164, 379 
Thermoplastics, 32-5 
adhesive, 415 
copolymer, 35 
flow defects 360-1 
molecular mass distribution, 33 
processing, 356-7 
structural repeat units, typical, 35 
tacticity, 35 

Thermosets, 36-8, 357-8, 375, 415 
Thompson tetrahedron: 
bee crystals, 112-3 
cph crystals, 108-12 
defect (stacking fault) tetrahedra, 111-2 
fee crystals 106-7 
ionic crystals, 114-5 
stair-rod dislocations, 107 
Throwing power, 388 

Time-temperature-transformation (TTT) diagrams, 274-6, 
281, 284 

Titanium alloys, 308-12, 410, 414 
o'-stabilizers, in, 308 
^-stabilizers, in, 308 
commercial, 310-11 
processing, 312 
Titanium aluminides, 314-15 
Tool tips, 329 
Transducers, 194 

Transformation-toughening (zirconia), 330 
Transformer laminations, 243 
Transistor, 184 

Transmutation of elements, 88 
Tresca yield criterion, 235-6 
Turbine blades: 
ceramic, 322 

directional solidification of, 46-7 
superalloys, 305-8 
Twins: 

annealing, 244 
boundary, 98-9 
boundary energy, 101 
comparison with slip, 203 
crystallography, 221-2 
dislocation (pole) mechanism, 223-4 
fracture, 224 
impurities, effect of, 223 
prestrain, effect of 223 
stacking sequence, 102, 244 
Tyres, automobile, 353 

Undercooling, 42 
constitutional, 54-5 
effect on rate of precipitation, 81 
Uranium, 122 

Urea-formaldehyde (U-F) resin, 36, 415 
Vacancies: 

activation energy of formation, 89 
activation energy of migration, 89 


chemical stress, 89-90 
clustering, 104, 120, 269 
creep mechanisms, 247-9 
definition, 84 
di- vacancies, 89 
diffusion of, 174-5 
emission, 122 

equilibrium concentration, 85 
fatigue, 254 

Frenkel defect (vacancy-interstitial pair), 86 

precipitation, role during, 268-71 

Schottky defect (vacancy pair), 86 

self-diffusion, 86, 174 

sinks, 85, 89, 117, 122 

sources, 85 

tri-vacancies, 89 

Vacuum melting and degassing, 45 
Vapour deposition: 

chemical (CVD), 323, 342, 368, 388 
physical (PVD), 388-90 
plasma-assisted chemical (PACVD), 388 
plasma-assisted physical (PAPVD), 390 
Varistor, 185 
Vegard’s Law, 168 
Viscoelasticity, 35, 351, 359 
Viscosity: 

glasses, 345-6 
thermoplastic melts, 358-9 
Voids: 

debonding in composites, 371 
ductile fracture, 104, 293 
fatigue failure, 256-8 
formation of, 90, 104 
gas-filled, 104 
irradiation swelling, 122-3 
sintering, 119 

von Mises yield criterion, 220, 287 
Vulcanization, 35 
Vycor glass, 48, 333 

Wagner-Hauffe Rule, 380 

Waste management (plastics), 354-5 

Wave number, 181 

Weld decay, 386 

Whiskers, 284, 369 

Wiedemann-Franz law, 181 

Widmanstatten structure, 53, 83, 262 

‘Windows’, ceramic, 196 

Wood: 

anisotropy, 407 
cell structure, 406 
properties, 407 
types, 407 

Work (strain)-hardening, 226-32 
brittle behaviour, 289-90 
creep, during, 246-7 
dislocation-dislocation interaction, 212 
jog theory, 93-4 
Lomer-Cottrell barrier, 103, 107 
relation to slip, 206 
Taylor model, 226 

X-ray diffraction: 
methods of analysis: 

diffractometry (XRD), 137-8 
Laue, 135, 261 
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X-ray diffraction: ( cont .) 
topography, 138 

powder (Debye-Scherrer), 136, 179 
principles: 

‘absent’ reflections, 139-40, 141 
asterism, 238, 267 
atomic scattering factor, 139 
Bragg law, 134 

determination of lattice parameters, 140 
determination of solvus, 140 
intensity of diffraction, 138-40 
line-broadening, 140-41 
reciprocal lattice, 141-2 
reflection (line) number (N), 134 
small angle scattering (SAS), 141, 261, 262 
structure-factor equation, 139-40 
X-rays: 

absorption coefficients, 133, 162 
characteristic, 133, 150-2 
continuous (‘white’) spectrum, 133, 161 
diffraction by crystals, 134 

extended X-ray absorption fine-structure spectroscopy 
(EXAFS), 163 
filtering, 134 

intensity measurement, 138 
scattering amplitudes, 162 


Si-Li detector, 151 

wavelength- and energy-dispersive spectrometers, 150-1 

YBCO (1-2-3 compound) superconductor, 187 
Yielding, discontinuous: 
effect of overstraining, 211 
effects of temperature and grain size, 217-8 
Liiders band formation, 211-2 
ordered alloys, 214 
strain age-hardening, 211 
yield point (upper, lower), 211 
Young’s modulus, 197, 203, 321 see also Property ratios 
Yttria, 326, 330 

Yttrium- aluminium-garnet (YAG), 328 

Z(AF) micro-analysis technique, 152 
Zener drag equation, 242-3 
Zeolites, 30 
Zirconia, 330-1 

fully-stabilized cubic (CSZ), 330 
partially-stabilized (PSZ), 330, 331 
refractories, 330-1 

tetragonal zirconia polycrystal (TZP), 330, 331 
toughened alumina (ZT(A)), 331 
Zone-refining, 46, 55-6, 185 



